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Abstract 

Imagine  in  a  social  network  that  an  agent  wants  to  form  a  connection  with  a  target 
user(s)  by  interacting  with  the  friends  of  the  target(s).  Because  forming  a  connection 
is  known  as  following  in  social  networks  such  as  Twitter,  we  refer  to  this  as  the  follow- 
back  problem.  The  friends  of  the  target  user(s)  form  a  directed  graph  which  we  refer 
to  as  the  “friends  graph.”  The  agent’s  goal  is  to  get  the  target  to  follow  him,  and  he 
is  allowed  to  interact  with  the  target  and  the  target’s  friends.  To  understand  what 
features  impact  the  probability  of  an  interaction  resulting  in  a  follow-back,  we  conduct 
an  empirical  analysis  of  several  thousand  interactions  in  Twitter.  We  build  a  model 
of  the  follow-back  probabilities  based  upon  this  analysis  which  incorporates  features 
such  as  the  friend  and  follower  count  of  the  target  and  the  neighborhood  overlap  of 
the  target  with  the  agent.  We  then  use  this  model  to  solve  the  follow-back  problem. 
We  find  optimal  policies  for  simple  network  topologies  such  as  directed  acyclic  graphs 
and  single  cycle  graphs.  For  arbitrary  directed  graphs  we  develop  heuristics  based 
upon  a  graph  score  measure  we  define  as  the  followback  score.  We  show  through 
simulation  that  these  heuristic  policies  perform  well  on  real  Twitter  graphs. 
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Chapter  1 


Introduction 

1.1  Business  Motivation 

Social  networks  have  rapidly  risen  in  size  over  the  past  decade  and  today  represent 
one  of  the  main  platforms  through  which  social  interactions  occur.  Current  estimates 
are  that  74%  of  adult  Internet  users  are  on  social  media  and  in  many  other  parts  of  the 
world,  including  emerging  markets,  this  penetration  rate  is  even  higher  [28].  Today 
people  use  social  networks  as  the  primary  mechanism  to  interact  with  friends,  make 
business  connections,  and  find  like-minded  individuals.  Therefore,  it  is  important  to 
understand  how  to  effectively  engage  with  users  in  a  social  network. 

As  a  specific  situation,  consider  an  individual  or  company  trying  to  establish  its 
presence  in  a  new  market.  One  way  to  build  its  brand  recognition  is  to  advertise  over 
social  media.  Presumably,  there  is  a  group  of  influential  individuals  in  this  market 
that  already  have  well  established  social  media  presences.  The  company  that  is  new 
to  the  market  may  want  to  connect  with  these  influential  users  to  tap  into  their  well- 
established  networks.  What  is  the  best  way  to  do  this  over  social  media?  In  the 
off-line  setting,  humans  understand  the  basics  of  social  interaction.  For  instance,  it 
is  harder  to  connect  with  very  popular  people  versus  less  popular  people.  Also,  it  is 
easier  to  connect  with  someone  if  you  share  mutual  contacts.  We  wish  to  understand 
if  these  social  intuitions  translate  into  the  online  setting  and  if  so,  how  does  one 
implement  these  intuitions?  Such  an  understanding  would  allow  for  more  effective 
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interactions  in  social  networks.  People  could  more  easily  connect  with  users  they 
deem  interesting.  Companies  and  organizations  could  engage  with  more  customers 
and  grow  their  online  social  networks  more  quickly.  Individuals  could  systematically 
gain  influence  and  followers  in  social  networks.  In  short,  if  we  understand  what  drives 
online  interactions,  we  can  develop  policies  to  make  us  more  effective  users  of  social 
networks. 


1.2  National  Defense  Motivation 

With  its  rapid  growth,  social  media  has  gained  the  interst  of  national  defense  policy 
makers  and  strategists.  In  January  of  2016,  The  White  House  held  a  summit  with 
executives  from  some  of  the  leading  American  technology  firms  including  Facebook 
and  Twitter  to  help  generate  strategies  to  counter  extremist  recruitment  online  [34], 
Defense  Secretary  Ash  Carter  met  with  officials  from  Facebook  in  April  of  2015  [3]. 

Anti- Access/ Area  Denial  (A2/AD)  is  a  defense  paradigm  used  to  describe  a  strat¬ 
egy  of  preventing  one’s  adversary  from  operating  military  forces  within  a  specific 
region  of  interest.  United  States  defense  strategists  consider  the  A2/AD  capability 
posed  by  American  adversaries  to  be  one  of  the  greatest  threats  of  the  twenty-first 
century  [29].  An  adversary  can  also  pose  an  A2/AD  threat  over  the  information 
space,  which  encompasses  both  physical  and  non-physical  domains  where  informa¬ 
tion  is  spread  [29].  This  is  a  threat  that  the  United  States  has  increasingly  seen. 

The  Chinese  Communist  Party  has  recognized  the  value  of  popular  content  on 
social  media.  It  employs  thousands  of  social  media  operators,  or  “wumao  dang,”  to 
share  material  directed  by  the  state  [21].  These  accounts  help  draw  attention  to 
topics  the  party  wants  to  highlight  [21],  By  intentionally  inundating  the  social  media 
platform,  the  tool  can  hinder  the  free  movement  of  alternative  ideas  and  information 
and  thus  can  be  analyzed  through  the  A2/AD  paradigm.  Non-state  adversaries  have 
employed  similar  strategies. 

In  summer  2014,  the  Islamic  State  in  Iraq  and  the  Levant  (ISIL)  launched  an  ap¬ 
plication  on  the  Google  Play  Store  called  “The  Dawn  of  Glad  Tidings.”  If  a  Twitter 
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user  downloaded  this  application,  his  account  could  be  used  by  ISIL  to  post  content 
[6].  The  idea  behind  the  application  was  simple:  use  existing  Twitter  accounts  of 
jihadist-sympathizers  to  amplify  the  message  of  the  terror  group.  However,  the  con¬ 
sequences  were  far  greater.  By  hijacking  Twitter  accounts  to  broadcast  its  message, 
ISIL  built  a  quasi-active  community  of  supporters  on  the  social  media  site.  An  in¬ 
terested  onlooker  would  perceive  a  network  of  Twitter  users  engaging  one  another 
and  posting  content  sympathetic  to  the  jihadist  group’s  cause,  even  while  much  of 
this  activity  was  automated.  ISIL  social  media  recruiters  also  deliberately  attempt 
to  isolate  their  targets  from  the  rest  of  the  world  by  encouraging  them  to  terminate 
their  existing  social  media  connections  [9].  ISIL  attempted  to  restrict  the  flow  of 
alternative  information  to  its  target  audience:  it  denied  its  prospective  recruits  free 
access  to  information.  ISIL  recruits  by  framing  a  perverse,  inflammatory  narrative 
about  the  tenets  of  Islam  and  the  West’s  subjugation  of  the  Muslim  world  [6].  It 
can  recruit  more  easily  where  opposing  voices  are  absent.  It  relies,  in  part,  upon  an 
A2/AD  capability  to  block  counter-propaganda  efforts. 

The  United  States  Department  of  Defense  uses  Military  Information  Support 
(MIS)  units  to  penetrate  contested  information  environments.  Military  Information 
Support  Operations  (MISO)  are  the  conveying  of  specific  information  to  target  foreign 
audiences  in  order  to  sway  their  perceptions  and  ultimately  influence  their  actions  in 
support  of  national  objectives  [1].  As  social  media  platforms  have  made  the  spread  of 
information  more  nuanced,  MIS  units  require  tools  that  will  allow  them  to  penetrate 
this  new,  increasingly  important  space,  in  the  event  that  an  adversary  employs  an 
A2/AD  capability.  When  an  adversary  is  attempting  to  use  propaganda  to  influ¬ 
ence  a  target  audience  over  social  media,  MIS  units  may  be  tasked  with  inserting 
counter-propaganda  information  in  the  network. 

One  problem  confronting  MIS  units  is  ensuring  that  their  counter-propaganda 
efforts  can  penetrate  an  information  blockade  over  social  media.  On  Twitter  and 
other  social  media  platforms,  for  instance  a  user  will  generally  only  see  the  content 
posted  by  users  who  he  “follows”  or  is  connected  with.  MIS  units  can  improve  their 
ability  to  penetrate  contested  social  media  platforms  by  learning  how  to  form  desired 
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connections.  We  study  strategies  that  can  be  used  by  an  agent  when  it  desires  to 
form  a  connection  with  a  target  user  or  users  and  call  this  the  follow-back  problem. 

The  author  does  not  advocate  that  the  US  Government  come  even  remotely  close 
to  mirroring  the  tactics  employed  by  our  adversaries.  The  United  States  should 
never  compromise  our  commitment  to  free  press.  However,  US  policymakers  must  be 
realistic  in  recognizing  that  all  of  our  potential  adversaries  might  not  conform  to  this 
value.  If  and  when  one  of  these  adversaries  uses  an  A2/AD  capability  against  us  in 
the  rapidly  changing  information  domain,  we  must  be  able  to  penetrate  the  blockade 
and  deny  them  any  tactical  advantage. 


1.3  Our  Contribution 

We  study  the  the  follow-back  problem,  where  the  goal  is  to  find  the  most  effective  way 
to  interact  with  users  in  a  social  network  in  order  to  form  connections  with  them.  The 
name  is  based  on  terminology  from  the  social  network  Twitter,  where  when  a  user 
forms  such  a  connection  he  is  said  to  be  a  follower.  We  begin  by  formally  defining 
the  follow-back  problem  in  mathematical  terms.  The  problem  definition  assumes 
some  model  for  user  behavior  in  social  networks,  in  particular,  what  features  affect 
the  probability  of  being  followed.  To  better  understand  this  model,  we  conduct  an 
online  held  experiment  in  Twitter  where  online  agents  interact  with  users  in  order  to 
gain  followers.  Our  experimental  findings  show  that  the  follow  probability  is  affected 
by  the  number  of  friends  and  followers  of  the  targeted  user  and  also  the  number  of 
friends  of  the  target  user  that  follow  the  agent,  a  feature  we  refer  to  as  the  overlap.  In 
particular,  we  find  that  the  follow  probability  is  larger  if  the  targeted  user  has  many 
friends,  few  followers,  and  a  large  overlap  with  the  agent.  These  observations  match 
our  social  intuition  which  suggets  that  it  is  easier  to  connect  with  people  who  are  less 
popular  (many  friends,  few  followers)  and  who  have  mutual  contacts  (large  overlap). 

We  propose  a  simple  model  for  the  follow  probability  based  on  this  empirical 
analysis  and  use  this  model  in  our  analysis  of  the  follow-back  problem.  We  are  able 
to  provide  the  optimal  interaction  policy  for  directed  acyclic  graphs  (DAGs)  and 
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provide  an  approximation  for  the  expected  number  of  follows  this  policy  gives  on  a 
DAG,  which  we  define  as  the  DAG’s  follow-back  score.  We  use  follow-back  score  to 
design  two  different  algorithms  that,  given  an  arbitrary  directed  graph,  try  to  find  a 
good  candidate  directed  acyclic  subgraph.  We  show  that  our  heuristics  perform  well 
on  different  real  and  synthetic  social  media  graphs. 

The  thesis  is  outlined  as  follows.  We  review  related  literature  in  1.4  and  formally 
define  the  follow-back  problem  in  1.5  of  this  chapter.  We  then  present  our  empirical 
analysis  of  follow  probabilities  in  the  following  chapter  2.  We  analyze  optimal  policies 
for  the  follow-back  problem  and  define  follow-back  score  in  chapter  3.  Next,  we  discuss 
our  algorithm  for  a  graph  in  which  there  is  a  single  target  and  present  results  for  this 
algorithm  in  4.  Then,  we  introduce  the  general  follow-back  problem  for  a  graph  with 
multiple  targets  and  show  how  constrained  optimization  is  used  to  find  a  policy  in  5. 
We  explain  some  avenues  of  future  work  and  conclude  in  6. 

In  this  article,  we  propose  the  follow-back  algorithm  to  maximize  the  likelihood  of 
being  connecting  with,  or  being  “followed,”  by  selected  user(s)  on  the  social  network 
Twitter.  The  main  contributions  of  our  work: 

•  We  believe  that  our  research  is  the  first  to  formalize  a  problem  of  connecting 
with  a  desired  user  or  group  of  users  over  social  media. 

•  We  run  held  experiments  on  Twitter  and  present  features  of  a  user  that  impact 
his  propensity  to  connect,  or  “follow.” 

•  We  use  the  insights  from  our  experiments  to  provide  algorithms  that  social 
media  operators  can  use  to  increase  the  chances  of  gaining  desired  connections. 


1.4  Literature  Review 

There  are  several  lines  of  research  related  to  our  work.  First,  there  is  work  on  empirical 
studies  of  influence  in  social  networks.  Second,  there  is  the  body  of  work  on  the 
sociological  notion  of  triadic  closure  and  its  manifestation  in  online  social  networks. 
Third,  there  is  a  sizable  body  of  work  on  theoretical  analysis  of  influence  maximization 
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in  social  networks.  Fourth,  there  is  a  well-studied  problem  called  the  maximum  acyclic 
subgraph  problem.  Finally,  there  is  the  broad  area  of  network  centrality  measures. 

Many  researchers  have  studied  what  factors  affect  influence  in  social  networks. 
Influence  can  be  roughly  defined  as  the  ability  to  cause  an  individual  to  adopt  a  new 
product  or  behavior.  The  ability  of  social  influence  to  affect  health  behavior  was 
demonstrated  in  [14]  and  [15].  The  effect  of  the  local  structure  of  a  social  network  on 
health  behavior  was  demonstrated  in  [10].  The  use  of  randomized  online  experiments 
to  discern  causal  social  influence  was  done  in  [4],  In  [5]  a  massive  randomized  online 
experiment  in  Facebook  was  carried  out  to  infer  what  types  of  users  are  influential  and 
susceptible.  It  was  found  that  influence  and  susceptibility  depended  on  demographic 
characteristics  as  well  as  network  structure.  [27]  analyzes  the  Twitter  network  struc¬ 
ture  as  well  as  the  temporal  aspects  and  network  structure  of  retweets.  [11]  analyzes 
the  influence  of  users  in  a  network  based  upon  number  of  retweets  and  number  of 
followers.  [22]  showed  that  data  from  Facebook  could  be  used  to  understand  struc¬ 
tural  features  of  networks  to  better  understand  social  contagion.  [5]  used  Facebook 
data  to  try  and  infer  which  users  are  classified  as  influential  and  which  are  classified 
as  susceptible  to  being  influenced  based  upon  demographic  characteristics  as  well  as 
network  structure.  [4]  showed  that  using  data  made  from  randomized  interactions  in 
social  media  are  valuable  in  understanding  influence. 

The  overlap  feature  which  we  present  in  2.5  can  be  considered  as  a  directed  graph 
version  of  the  notion  of  triadic  closure  from  sociology,  or  “directed  closure”  [16].  A 
graph  theoretic  view  of  triadic  closure  is  provided  in  [17].  This  principle  is  used  in 
[8]  to  design  a  social  botnet  on  Facebook.  The  algorithm  in  [8]  strives  to  infiltrate 
a  Facebook  network  by  leveraging  the  principle  of  triadic  closure.  They  initially 
acquire  friends  from  a  set  of  vulnerable  users  and  then  snowball  their  engagements 
by  identifying  users  with  whom  they  have  mutual  connections.  The  effect  of  social 
overlap  on  influence  was  shown  in  [22],  Here  a  massive  Facebook  dataset  was  studied 
and  it  was  found  that  adoption  of  Facebook  by  a  new  user  depended  strongly  on  how 
many  people  invited  that  user  and  their  local  network  structure.  A  similar  result  is 
found  in  [18]  where  it  is  shown  that  the  likelihood  of  accepting  a  connection  request 
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in  an  online  social  network  is  about  three  times  higher  given  the  existence  of  some 
number  of  mutual  connections. 

Related  to  the  follow-back  problem  is  the  influence  maximization  problem  where 
the  goal  is  to  maximize  the  spread  of  a  piece  of  information  through  a  social  network 
by  seeding  a  select  set  of  users.  One  of  the  first  theoretical  studies  of  the  influence 
maximization  problem  was  in  [24]  where  it  was  shown  that  the  problem  was  sub- 
modular,  so  a  greedy  algorithm  would  have  good  performance.  This  work  led  to 
subsequent  variations  of  the  problem  and  different  algorithmic  solutions  [25],  [12], 
[13].  Each  of  these  works  exploited  the  sub-modular  structure  of  the  problem  to 
obtain  efficient  algorithms. 

The  study  of  Directed  Acyclic  Graphs  (DAGs)  overlaps  with  the  study  of  partially 
ordered  sets,  since  a  partial  order  may  be  imposed  on  the  nodes  of  a  DAG.  [33] 
discusses  the  process  of  “refining”  a  partial  order.  Finding  a  directed  acyclic  subgraph 
for  an  arbitrary  directed  graph  that  is  the  most  refined  partial  order  on  the  graph 
possible  is  deemed  the  maximum  acyclic  subgraph  problem.  It  seeks  to  select  the 
maximum  number  of  edges  from  the  directed  graph  subject  to  the  induced  subgraph 
being  acyclic  [35].  This  problem  is  generally  NP-hard. 

Many  problems  in  graphs  can  be  solved  through  the  development  of  functions 
which  map  vertices  to  real  numbers.  These  functions  are  known  as  network  centrali¬ 
ties.  They  quantify  how  central  a  vertex  is  to  the  problem  at  hand,  with  the  definition 
of  centrality  being  set  by  the  problem  at  hand.  In  our  work  we  use  centralities  to 
both  compare  the  performance  of  our  algorithms  as  well  as  to  supplement  our  al¬ 
gorithms  themselves.  One  of  the  simplest  is  distance  centrality  which  measures  how 
close  a  vertex  is  to  all  other  vertices  [31].  A  related  measure  is  betweenness  centrality, 
which  measures  how  many  paths  pass  through  a  vertex  [19],  [20].  This  measure  gives 
importance  to  vertices  that  are  critical  to  the  flow  through  a  graph.  Eigenvector 
centrality  measures  how  important  a  vertex  is  by  how  important  its  neighbors  are 
[7].  This  centrality  measure  is  closely  related  to  the  PageRank  [30]  and  HITS  [26] 
centrality  measures.  Rumor  centrality  was  developed  to  fold  the  source  of  a  rumor 
that  spreads  in  a  graph  [32],  It  was  shown  that  the  vertex  with  the  highest  rumor 
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centrality  corresponded  to  the  maximum  likelihood  estimate  of  the  rumor  source  in 
some  specific  graph  topologies  and  that  rumor  centrality  was  related  to  the  number 
of  linear  extensions  of  a  partially  ordered  set. 


1.5  The  Follow-back  Problem 

Initially  at  time  n  —  0  we  have  a  primary  target  vertex  t,  a  set  of  vertices  t  follows, 
which  we  refer  to  as  the  friends  of  t,  and  an  agent  vertex  a.  The  friends  form  the 
friends  graph  Go  =  (V,  E0)  where  E0  is  the  initial  edge  set.  A  directed  edge  (u,v) 
pointing  from  u  to  v  for  some  in  the  friends  graph  means  that  u  is  followed 

by  v.  In  a  social  network  such  as  Twitter  this  means  that  any  content  u  posts  will 
be  visible  to  v.  V  is  the  fixed  vertex  set  of  the  graph,  which  we  define  as  including 
the  friends,  t,  and  a.  Each  vertex  v  has  a  set  of  features  Fv  associated  with  it  and 
we  define  the  set  of  all  vertex  features  as  F  =  |J  y  Fv.  For  the  friends  we  have  that 
(v,t)  G  Eq  Vu  G  V\a.  This  is  because  initially  t  follows  all  vertices  in  V  except  for 
a.  We  will  assume  that  a  initially  has  no  edge  with  any  vertex  in  V.  The  goal  in 
the  follow-back  problem  is  for  the  agent  a  to  have  the  primary  target  t  follow  him  by 
interacting  with  the  vertices  in  V.  In  other  words,  a  wants  to  create  the  edge  (a,t). 
We  illustrate  the  setting  of  the  follow-back  problem  in  Figure  1-1. 

At  each  time  n,  a  can  interact  with  a  single  vertex  in  V.  The  vertex  chosen  by  a  at 
time  n  is  denoted  un.  The  interaction  of  the  agent  with  a  target  vertex  is  some  form  of 
social  network  interaction.  In  Twitter  these  interactions  include  following,  retweeting, 
and  mentioning.  We  require  that  un  ^  Ui  for  any  i  <  n,  meaning  that  a  can  only 
interact  with  each  v  G  V  once.  We  define  the  interaction  policy  of  a  as  the  vertex 
sequence  tt  =  {iq,  u2, ...}  along  with  the  sequence  of  interactions  x  =  {xui,  xU2, ...}. 
Here  xUi  specihes  the  type  of  Twitter  interaction  (following,  retweeting,  or  mentioning) 
with  Ui .  When  at  time  n  the  agent  engages  with  un ,  either  un  will  follow  a  or  un  will 
not  follow  a.  We  let  the  random  variable  XUn  be  one  if  the  engagement  of  a  results  in 
un  following  a,  otherwise  XUn  is  zero.  The  vertex  set  does  not  change,  but  the  actions 
of  a  can  change  the  edge  set  En.  If  the  agent’s  action  xUn  gains  a  follow  from  vertex  un, 
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Figure  1-1:  Illustration  of  the  setting  of  the  follow-back  problem.  The  agent  a  wants 
to  be  followed  by  the  primary  target  t  by  interacting  with  the  vertices  in  the  friends 
graph. 

then  we  update  the  graph  to  Gn  =  (V,  En)  where  En  =  En_ i  jj(a,  wn).  Furthermore, 
we  assume  that  a  target  cannot  follow  the  agent  without  being  interacted  with  first 
and  that  once  the  target  follows  the  agent,  he  cannot  “unfollow."  This  means  that 
edges  can  only  be  formed  in  response  to  an  interaction  by  the  agent  and  edges  cannot 
be  destroyed. 

The  state  at  time  n  is  the  graph  Gn  and  the  vertex  feature  set  F.  Denote 
the  follow  probability  of  a  vertex  un  in  response  to  the  agent  interaction  xUn  as 
P  (XUn  =  1| Gn,xUn,  F).  We  are  assuming  here  that  the  follow  probability  of  un  only 
depends  upon  the  current  state  of  the  graph,  the  type  of  interaction,  and  some  fixed 
vertex  features. 

In  the  follow-back  problem,  the  agent  selects  the  policy  (it,  x)  to  maximize  the 
probability  that  the  target  t  will  follow  back.  That  is,  the  agent  chooses  a  policy 
(7T,  x)  to  solve  the  following  problem: 

maxE  [Xt\n,  x,  G0,  F] .  (1.1) 

(7T,x) 

We  will  see  that  the  solution  to  this  problem  depends  upon  understanding  the  struc¬ 
ture  of  the  follow  probabilities  which  characterize  the  interaction  dynamics.  Our 
empirical  analysis  in  Section  2  will  show  how  these  follow  probabilities  depend  upon 
various  features  of  the  vertices  and  the  friends  graph. 
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Chapter  2 


Empirical  Analysis  of  Interactions  in 
Social  Networks 


To  determine  the  structure  of  the  follow  probabilities  we  require  data  on  interactions 
in  social  networks.  To  obtain  such  data  we  conducted  an  online  experiment  involving 
user  accounts  to  which  we  were  given  access.  The  accounts  belonged  to  six  Moroccan 
artists  who  were  interested  in  connecting  with  Twitter  users  interested  in  Moroccan 
art.  The  artist  accounts  would  interact  with  Twitter  users  using  different  types  of 
interactions,  and  we  recorded  the  results  of  the  interactions  after  one  week.  This  was 
to  allow  time  for  the  reaction  of  the  targets  to  occur,  as  many  times  they  may  not  be 
immediate.  We  were  able  to  obtain  data  for  over  6,000  Twitter  interactions.  We  now 
discuss  our  empirical  findings  in  detail. 

2.1  Empirical  Setup 

We  conducted  a  series  of  experiments  to  understand  different  aspects  of  interactions 
in  social  networks.  As  mentioned  above,  we  utilized  six  different  Twitter  accounts 
from  Moroccan  artists.  These  accounts  acted  as  our  agents.  We  conducted  three 
different  experiments.  The  first  experiment  was  designed  to  understand  the  effect  of 
the  different  types  of  interactions  in  Twitter  on  the  follow  probability.  The  second 
experiment  aimed  to  understand  how  vertex  features  such  as  friend  count  and  follower 
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count  affect  the  follow  probabilities.  The  third  experiment  looked  at  more  complex 
graph  features,  such  as  the  number  of  the  agent’s  followers  who  are  also  friends  of  the 
target,  which  we  refer  to  as  the  overlap. 

The  empirical  procedure  was  as  follows.  For  the  first  two  experiments  we  had  the 
agents  use  the  Twitter  search  API  to  find  target  users  who  have  posted  tweets  about 
Morocco  or  art.  The  agents  then  interacted  with  all  of  these  targets  in  some  way. 
We  had  the  agents  perform  this  search  and  interact  procedure  in  randomly  spaced 
intervals  that  were  on  average  two  hours  apart.  The  experiments  were  each  run  for  one 
week.  We  waited  one  week  from  when  the  interaction  occurred  and  checked  if  there 
was  any  sort  of  reaction  from  the  targets.  For  the  third  experiment  the  procedure 
was  the  same,  except  to  find  target  users,  in  addition  to  using  the  search  API,  we 
also  searched  for  followers  of  users  who  followed-back  the  agents  during  the  first  two 
experiments. 

We  made  sure  that  each  target  user  only  interacted  with  a  single  Moroccan  artist 
account  to  avoid  the  risk  of  a  target’s  reaction  being  confounded  by  multiple  inter¬ 
actions.  Also,  each  target  user  was  only  interacted  with  one  time.  The  six  different 
Twitter  agent  accounts  we  used  have  different  names  and  profile  images,  but  the 
content  they  posted  on  their  Twitter  Timelines  is  similar.  This  content  consisted  of 
images  of  their  artwork,  which  was  similar  across  the  different  accounts. 

We  modeled  the  follow  probabilities  using  a  logistic  regression  model.  If  we  denote 
the  follow  probability  of  a  target  vertex  v  as  P  (. Xv  =  l|G,x,  F)  =  p(Sv)  where  Sv 
is  the  set  of  all  data  associated  with  the  interaction.  Under  our  logistic  regression 
model  the  follow  probability  for  a  vertex  v  is 

<2^> 

where  we  have  assumed  that  for  a  vertex  v  the  set  Sv  has  d  elements. 
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2.2  Interaction  Type 


There  are  three  main  ways  an  agent  can  interact  with  a  target  in  Twitter.  The  first 
is  following.  When  the  agent  follows  a  target  any  content  the  target  posts  is  shown 
to  the  agent  in  his  Twitter  timeline.  In  addition,  when  the  agent  follows  the  target, 
the  target  is  notified  that  the  agent  is  now  following  him.  The  second  interaction  is 
retweeting,  which  is  reposting  one  of  the  target’s  tweets.  The  agent’s  timeline  displays 
the  original  tweet,  and  the  target  is  notified  of  the  retweet.  The  third  interaction  is 
replying,  which  involves  the  agent  creating  a  tweet  that  mentions  the  target.  The 
tweet  appears  on  the  agent’s  timeline  and  the  target  is  notified. 

We  had  one  agent  account  post  original  content  and  not  interact  with  other  users. 
This  agent  acted  as  a  control.  The  remaining  five  agents  interacted  with  150  to  220 
different  targets  over  a  one  week  period.  Each  agent  was  assigned  a  specific  type  of 
interaction.  These  interactions  were  following,  retweeting,  replying,  retweeting  and 
following,  and  replying  and  following.  Agents  who  replied  used  a  fixed  set  of  messages 
such  as  “I  like  that”  or  “Nice”. 

The  target  can  react  to  the  agent’s  interaction  by  mentioning,  retweeting,  or 
following.  If  the  agent’s  interaction  produces  a  reaction  from  the  target,  we  refer 
to  this  as  a  conversion.  We  show  the  conversion  rate  for  each  combination  of  agent 
interaction  and  target  reaction  in  Figure  2-1.  The  control  agent  that  did  not  interact 
with  anyone  is  referred  to  as  “None”  in  the  figure.  This  agent  did  not  receive  any 
form  of  target  reaction.  One  observation  from  this  figure  is  that  to  gain  followers,  the 
agent  must  interact  with  others. 

We  run  our  regression  model  for  the  target  reaction  of  following  using  three  indi¬ 
cator  variables  for  following,  retweeting,  or  replying.  Table  2.1  shows  that  the  only 
interaction  type  that  was  significant  (at  the  5%  level)  in  predicting  the  follow  reaction 
was  following,  which  increased  the  follow  probability.  This  analysis  shows  that  for 
the  follow-back  problem,  the  best  interaction  to  use  is  following. 
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Follow  None  Reply  Reply  &  Follow  RT  RT  &  Follow 

Interaction 


Figure  2-1:  Conversion  rate  for  different  agent  interaction  and  target  reaction  com¬ 
binations  (RT  stands  for  retweet). 


Table  2.1:  Interaction  Type 


Feature 

P 

p- value 

Intercept 

-4.18*** 

3.25  x  10~15 

Follow 

2  14*** 

1.25  x  10~5 

Retweet 

-0.14 

0.68 

Reply 

-0.22 

0.48 

The  significance  codes  are  ***  :  0.001,  **  :  0.01,  and  *  :  0.05. 

2.3  Interaction  Type,  Follower  Counts,  Friend  Counts 


We  next  run  a  regression  using  the  same  data  set  from  above  but  include  the  agent 
and  target  friend  and  follower  counts.  We  see  that  in  table  2.2,  after  adding  friend  and 
follower  counts  to  the  regression,  the  significance  of  the  interaction  types  decrease,  as 
the  target  friend  and  follower  counts  are  the  features  with  highest  significance.  This 
finding  implies  that  social  media  interactions  may  be  too  complicated  of  phenomena 
to  adequately  predict  responses  with  simple  indicator  variables.  People  respond  very 
differently  to  the  varying  forms  of  interaction  available  over  social  media-both  verbal 
and  non-verbal.  However,  the  fact  that  the  agent  who  did  not  initiate  interaction 
with  anyone  gained  no  following,  supports  the  intuitive  idea  that  interacting  in  some 
manner  is  best  if  one  wants  to  gain  connections.  Being  an  isolated  user  that  avoids  in¬ 
teraction  does  not  lend  well  to  establishing  a  network.  In  practice,  to  most  effectively 
gain  followers,  a  social  media  operator  would  want  to  tailor  each  interaction  so  that 
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Table  2.2:  Interaction  Type 


Feature 

P 

p- value 

Intercept 

-10.6994*** 

1.31  x  lO”08 

logl0(target  friend  count) 

3.4066*** 

6.28  x  10~10 

logl0(target  follower  count) 

-1.9575*** 

1.03  x  10~U5 

logl0(agent  friend  count) 

0.2954 

0.7634 

logl0(agent  follower  count) 

1.4605 

0.2883 

Follow 

1.6365* 

0.0211 

Retweet 

-0.7689 

0.1096 

Reply 

-0.9330* 

0.0238 

The  significance  codes  are  ***  :  0.001,  **  :  0.01,  and  *  :  0.05. 


it  has  the  greatest  chance  of  winning  the  favor  of  the  target.  We  further  investigate 
the  significance  of  follower  and  friend  counts  in  2.4. 


2.4  Friend  and  Follower  Counts 

Our  next  experiment  aimed  to  understand  how  the  friend  and  follower  counts  of  both 
the  target  and  agent  impacted  the  follow  probability.  Based  on  our  study  of  different 
Twitter  interactions,  we  decided  to  have  the  agents  only  follow  and  retweet  the  targets. 
Prior  to  this  experiment,  the  agents  obtained  followers  from  an  online  service.  This 
allowed  for  sufficient  variance  in  their  follower  counts,  with  one  agent  receiving  as 
many  as  10,000  followers.  These  follower  accounts  appeared  to  be  automated  bot 
accounts.  However,  our  interest  was  in  changing  the  follower  count  displayed  on  the 
agent’s  profile,  not  in  the  quality  of  the  followers  themselves.  We  wanted  to  see 
if  having  a  higher  follower  count  would  make  the  agent  seem  more  important  and 
increase  the  follow  probability.  We  were  able  to  measure  over  2,000  interactions  for 
the  agents. 

We  first  examine  the  impact  of  the  agent  follower  count  on  the  follow  probability. 
In  Figure  2-2  we  plot  the  conversion  rate  for  follows  versus  the  follower  count  of 
the  agent.  The  results  indicate  that  the  follower  count  of  the  agent  does  not  have 
a  significant  impact  on  the  conversion  rate  for  follower  counts  up  to  10,000.  This 
is  somewhat  counter  intuitive  as  we  would  expect  users  to  be  more  likely  to  follow 
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Initiating  User  Follower  Count 


Figure  2-2:  Plot  of  the  follow  conversion  rate  versus  agent  follower  count.  The  hor¬ 
izontal  axis  shows  the  follower  counts  of  four  agents.  The  vertical  axis  shows  the 
percentage  of  targets  that  followed  each  agent. 


someone  with  a  higher  follower  count,  as  this  is  a  symbol  of  status  on  Twitter.  Our 
data  suggests  that  it  is  not  clear  if  the  agent’s  follower  count  has  any  effect  on  the 
follow  probability. 

The  follower  and  friend  counts  of  the  target  had  a  clearer  relationship  to  the 
follow  probability.  We  plot  in  Figure  2-3  the  friend  and  follower  count  of  each  target. 
We  use  different  markers  for  those  that  followed  the  agent  and  those  that  did  not. 
A  clear  pattern  is  seen  here.  Target’s  with  lower  follower  counts  and  higher  friend 
counts  tended  to  follow  the  agents  more  frequently.  This  makes  sense  intuitively  as 
we  would  assume  that  user’s  with  higher  friend-to-follower  ratios  have  a  propensity 
to  follow  others  on  Twitter. 

Our  regression  model  used  the  logarithm  (base  10)  of  the  friend  and  follower 
counts  of  the  agent  and  targets  as  features.  The  results  of  our  model  fitting  is  shown 
in  Table  2.3  and  support  what  was  seen  in  the  figures.  The  follower  and  friend  counts 
for  the  target  were  significant  while  the  friend  and  follower  counts  of  the  agents  were 
not.  Also,  targets  with  many  friends  and  few  followers  were  more  likely  to  follow  the 
agent. 
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Figure  2-3:  Plot  of  the  follower  count  versus  the  friend  count  for  each  target.  The 
markers  indicate  which  targets  followed  the  agent  and  which  did  not. 


Table  2.3:  Follower  and  Friend  Counts 


Feature 

0 

p- value 

Intercept 

-2.641*** 

<  2  x  10~16 

log10 (Target  Friend  Count) 

3  43*** 

6.28  x  10"10 

log10 (Target  Follower  Count) 

-2  0*** 

1.03  x  10"5 

log10 (Agent  Follower  Count) 

1.5 

0.29 

log10 (Agent  Friend  Count) 

0.30 

0.76 

The  significance  codes  are  ***  :  ( 

1.001,  **  :  0.01,  and  *  :  0.05 

2.5  Overlap 


Our  final  experiment  looked  at  the  overlap  feature,  which  is  related  to  the  concept 
of  triadic  closure  from  social  network  theory.  Triadic  closure  is  the  property  that 
if  two  individuals  A  and  B  both  have  strong  ties  to  C,  then  it  is  likely  that  some 
form  of  tie  exists  between  A  and  B.  Strong  triadic  closure  states  that,  given  this 
mutual  connection  with  C,  a  weak  tie  between  A  and  B  must  exist.  Triadic  closure 
is  a  term  usually  applied  to  undirected  graphs.  [16]  posits  the  existence  of  an  analog 
in  a  directed  network,  or  “directed  closure.”  We  define  the  overlap  for  two  vertices. 
Formally,  the  overlap  of  a  vertex  v  with  another  vertex  u  is  equal  to  the  number  of 
directed  two-hop  paths  from  v  to  u.  In  Twitter  terms,  user  v  has  an  overlap  with  user 
u  equal  to  the  number  of  friends  of  u  that  follow  v.  We  illustrate  this  overlap  feature 
in  Figure  2-4.  The  intuition  behind  this  definition  is  based  on  the  flow  of  information. 
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In  Twitter  content  posted  by  a  user  can  be  seen  by  his  followers.  Therefore,  the  target 
could  see  some  of  the  agent’s  content  if  it  is  retweeted  by  friends  of  the  target  that 
follow  the  agent.  The  target  could  also  see  any  replies  these  users  give  to  the  agent. 
Using  our  terminology,  we  call  these  users  members  of  the  agent’s  overlap  with  the 
target.  Seeing  this  content  may  make  the  target  more  interested  in  the  agent  and 
result  in  an  increased  follow  probability. 

We  had  the  agents  select  targets  by  using  the  Twitter  search  API  and  also  by 
choosing  the  followers  of  users  who  had  previously  followed-back  the  agent.  This  way 
we  were  able  to  obtain  targets  with  zero  and  positive  overlap.  The  agents  retweeted 
and  followed  each  target.  We  plot  the  follow  conversion  rate  versus  the  overlap  in 
Figure  2-5.  As  can  be  seen,  the  conversion  rate  increases  as  the  overlap  increases.  The 
increase  is  not  monotonic,  but  this  may  be  due  to  not  having  sufficient  data  points 
for  different  overlap  values,  as  seen  by  the  95%  confidence  intervals  in  the  figure.  Our 
regression  model  uses  overlap  as  a  feature  along  with  the  log  transformed  friend  and 
follower  counts  of  the  target  and  the  results  are  shown  in  Table  2.4.  As  can  be  seen, 
the  overlap  is  a  significant  feature  that  increases  the  follow  probability. 

Another  interesting  result  is  found  if  we  compare  our  measured  conversion  rates 
with  those  from  [22],  In  that  work,  the  conversion  rate  was  for  users  who  were  invited 
to  Facebook  that  actually  joined,  and  the  overlap  was  the  number  of  email  invitations 
the  user  received  from  distinct  people.  We  plot  conversion  rate  versus  overlap  from 
our  Twitter  data  and  the  Facebook  data  from  [22]  in  Figure  2-6.  We  normalize  our 
results  so  that  an  overlap  of  one  has  conversion  rate  one,  just  as  in  [22],  Remarkably, 
we  see  that  the  two  curves  are  almost  exactly  equal,  with  slight  differences  for  an 
overlap  of  four.  The  normalized  conversion  rate  is  the  same  for  gaining  a  follow 
in  Twitter  and  joining  Facebook.  These  are  very  different  actions  in  terms  of  effort 
required,  and  the  overlap  has  a  different  meaning  in  each  situation.  This  suggests  that 
the  overlap  feature  may  be  an  important  feature  in  a  wide  variety  of  social  networks 
and  its  impact  may  be  very  similar. 
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Overlap  =  3 


Figure  2-4:  Illustration  of  the  overlap  feature  between  the  agent  and  target  vertex. 
The  overlap  in  the  figure  is  three. 


Figure  2-5:  Plot  of  the  follow  conversion  rate  versus  overlap.  The  red  circles  are  the 
median  and  the  error  bars  are  95%  confidence  intervals. 


Table  2.4:  Overlap 


Feature 

0 

p- value 

Intercept 

-2.49*** 

4.10  x  10"8 

Overlap 

0.28*** 

6.57  x  10~12 

log10 (Target  Friend  Count) 

0.45* 

0.05 

log10 (Target  Follower  Count) 

-0.63*** 

1.77  x  10"4 

The  significance  codes  are  ***  :  C 

.001,  **  :  0.01,  and  *  :  0.05 
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Figure  2-6:  Plot  of  the  normalized  conversion  rate  for  our  data  on  follows  in  Twitter 
and  Facebook  signups  from  [22]  versus  overlap. 
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Chapter  3 


Theoretical  Development 

3.1  Optimal  Policies  for  the  Follow-Back  Problem 

We  now  study  optimal  policies  for  the  follow-back  problem  on  different  graph  topolo¬ 
gies.  Recall  that  the  goal  here  is  to  maximize  the  probability  that  a  primary  target 
vertex  follows  the  agent  by  interacting  with  the  primary  target  and  all  vertices  it  fol¬ 
lows.  The  vertices  following  the  primary  target  form  the  friends  graph  of  the  primary 
target.  Recall  that  we  saw  in  2.3  that  the  type  of  interaction,  i.e.  follow,  retweet,  or 
reply,  was  insignificant  in  our  follow  probability  regression,  so  we  will  consider  only 
one  type  of  generic  agent  interaction.  As  a  result,  the  interaction  portion  a;  of  a  policy 
will  be  the  same  for  all  policies,  and  we  can  ignore  it.  Furthermore,  for  simplicity, 
we  will  assume  that  the  agent’s  interaction(s)  with  one  user  must  conclude  before 
he  begins  interacting  with  another  user.  In  other  words,  the  agent  cannot  begin  in¬ 
teracting  with  one  user,  start  communicating  with  another  user,  and  switch  back  to 
interacting  with  the  former.  In  this  case  only  the  vertex  sequence  n  determines  the 
policy. 

3.1.1  Follow  Probability  Model 

To  solve  the  follow-back  problem  we  must  make  some  general  assumptions  on  the 
follow  probabilities.  Recall  that  earlier  we  assumed  the  follow  probability  depended 
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upon  vertex  features,  graph  state,  and  interaction  type.  Since  we  are  assuming  one 
type  of  interaction,  we  do  not  need  to  consider  this  in  the  functional  form  of  the 
follow  probability.  Based  on  our  empirical  analysis,  the  only  vertex  features  we  need 
to  consider  are  the  friend  and  follower  count  of  the  target.  For  a  target  v  we  will 
call  this  set  of  features  Fv.  However,  our  proceeding  analysis  analysis  applies  to  any 
general  vertex  features  Fv.  Also,  as  our  empirical  analysis  showed,  the  graph  feature 
we  need  to  include  is  the  overlap  between  the  agent  and  target.  We  define  the  overlap 
between  the  agent  and  target  v  as  (f)v. 

With  these  assumptions  we  can  write  the  follow  probability  of  a  target  v  in  re¬ 
sponse  to  an  interaction  by  the  agent  as  p{(f>v,  Fv).  We  further  assume  that  the  follow 
probability  has  the  product  form  p((f>v,Fv)  =  f(<f>v)g{Fv).  The  product  form  assump¬ 
tion  can  be  viewed  as  an  approximation  to  the  logistic  form  of  the  follow  probability 
given  by  equation  2.1  when  the  probability  value  is  small.  Assume  the  coefficient  of 
the  overlap  is  and  the  coefficient  vector  of  the  vertex  feature  set  is  /3p.  From  our 
logistic  regression  model,  the  follow  probability  is  given  by 

gAi+A^u  +/3  pFv 

p((Pv,  Fv)  =  x  +  eg0+Mv+fjTFv 

=  f{<t>v)g{Fv)- 

The  above  expression  is  a  good  approximation  when  the  term  in  the  exponent  is  much 
less  than  zero,  which  is  a  valid  assumption  given  our  empirical  observations.  We  use 
this  approximation  because  it  allows  us  to  obtain  simple  analytic  solutions  for  the 
follow-back  problem  on  different  graph  topologies. 

Finally,  we  assume  that  the  follow  probabilities  are  monotonically  increasing  in 
the  value  of  the  overlap,  which  we  saw  in  figure  2-5.  With  this  assumption,  it  is  clear 
that  an  optimal  policy  for  the  follow-back  problem  is  for  the  agent  to  first  maximize 
its  overlap  with  the  primary  target  and  then  follow  the  primary  target.  Therefore, 
the  follow-back  problem  reduces  to  choosing  a  sequence  of  vertices  n  in  the  friends 
graph  to  follow  which  maximizes  the  expected  number  of  follows  of  the  agent.  For  this 
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reason,  we  do  not  need  to  consider  the  edges  of  the  primary  target.  In  the  following 
analysis,  any  assumptions  on  the  friends  graph  topology  refer  to  the  graph  with  the 
primary  target  edges  removed. 

з. 1.2  Optimal  Policy  on  a  Directed  Acyclic  Graph  (DAG) 

We  begin  by  analyzing  the  follow-back  problem  on  the  simplest  graph  topology,  a 
directed  acyclic  graph  (DAG).  The  analysis  is  simple  here  because  the  graph  contains 
no  cycles.  Because  the  follow  probabilities  are  increasing  in  overlap,  the  intuitive 
solution  here  would  be  to  maximize  the  overlap  for  each  interaction  by  following 
a  vertex  after  following  its  parents.  To  formalize  this,  recall  that  a  DAG  can  also 
be  viewed  as  a  partially  ordered  set  (poset)  on  the  vertices,  with  the  partial  order 
requiring  that  a  vertex  must  come  after  its  parents.  Any  sequence  of  vertices  which 
respects  this  partial  order  is  known  as  a  linear  extension.  For  DAGs,  we  have  the 
following  result,  which  matches  the  intuition. 

Theorem  3.1.1.  Let  n*  be  the  optimal  policy  for  the  follow-back  problem  with  initial 
graph  G  which  is  a  directed  acyclic  graph.  Then  n*  is  a  linear  extension  of  G. 

What  is  interesting  about  this  result  is  that  any  linear  extension  is  an  optimal 
policy.  This  is  because  all  linear  extensions  result  in  the  same  number  of  expected 
follows.  Intuitively,  this  is  true  because  the  follow  probabilities  depend  only  upon 
features  of  the  target  and  the  overlap.  Under  our  assumptions  on  the  follow  proba¬ 
bilities,  interchanging  the  order  of  vertices  in  the  policy  in  a  way  that  respects  the 
partial  order  does  not  affect  any  of  these  features.  We  illustrate  the  linear  extension 
policy  in  Figure  3-1.  Here  there  are  multiple  optimal  policies  corresponding  to  each 
linear  extension  of  the  graph.  It  does  not  matter  which  one  is  selected  because  they 
all  respect  the  order  relationships  of  the  graph. 

Proof.  Let  the  friends  graph  be  G  =  (V,  E)  which  is  a  DAG.  Consider  two  vertices 

и,  v  G  V  such  that  there  is  a  directed  path  from  u  to  v  in  G  and  consequently  no 
path  from  v  to  u  because  G  is  a  DAG.  We  define  two  policies  n  =  {ui, ...,  u,  v , ...,  un} 
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Figure  3-1:  Illustration  of  several  optimal  linear  extension  policies  on  a  DAG.  The 
primary  target  is  followed  last  for  every  optimal  policy  after  the  agent  has  interacted 
with  every  vertex  in  the  friends  graph. 

and  7r'  =  {u\,  ...,v,u,  The  only  difference  in  the  two  policies  is  that  u  and  v 

are  swapped.  In  tt  vertices  u  and  v  respect  the  partial  order  of  G,  whereas  in  tt'  they 
do  not.  We  will  now  show  that  the  expected  follows  of  it'  is  less  than  n. 

The  expected  follows  of  a  policy  it  can  be  written  as 

E  [A"^|7r],  where  Xw  is  one  if  the  agent  interaction  with  w  results  in  a  follow 
by  w  and  zero  otherwise.  We  want  to  show  that  the  expected  follows  will  be  larger  if 
u  is  followed  before  v.  For  any  vertex  w  we  define  Sw  =  E  [A„,  |7r]  —  E  [A^lvr7]  and  we 
define  the  difference  in  the  expected  follows  of  the  two  policies  as  6  =  YIwgv  ^w-  For 
any  vertex  w  which  occurs  before  v  and  u  in  either  policy,  we  have  that  Sw  =  0.  This 
is  because  the  state  of  the  graph  when  the  agent  interacts  with  these  vertices  is  the 
same  under  each  policy.  For  vertex  u  we  have  Su  =  0  because  u  does  not  follow  v  and 
there  is  no  path  from  v  to  u.  Therefore,  there  is  no  way  for  v  to  affect  the  overlap  of 
u  with  the  agent,  and  thus  no  way  to  affect  the  subsequent  follow  probability.  For  v 
and  all  vertices  after  u  and  v  we  use  the  following  lemma. 

Lemma  3.1.2.  Let  {u\,  U2, ...,  un)  be  a  path  on  a  directed  graph  G.  Then  E [XUj  \XU.  = 
1]  >  E[XUj  \XUi  =  0]  for  1  <i  <  j  <  n. 

This  lemma  shows  that  if  the  agent  can  get  a  vertex  w  to  follow  it,  it  will  receive 
a  boost  in  the  expected  follows  for  any  vertex  on  a  path  from  w.  Now  consider  vertex 
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v.  Using  Lemma  3.1.2  we  have 


Sv=  E  [Xv\Xu  =  l,n]E[Xv\n] 

+  E[Xv\Xu  =  0,7r}(l-E[Xu\7r}) 

-E{Xv\Xu  =  0,n'} 

=E  [Xu\i r]  (E  [XV\XU  =  1,  tt]  -  E  [XV\XU  =  0,  vr]) 

>0. 

Above  we  have  used  Lemma  3.1.2  along  with  the  fact  that  Xt  are  Bernoulli  random 
variables,  so  E[X*]  =  P(Aj  =  1).  We  also  made  use  of  the  fact  that  E  [XV\XU  =  0, 7r]  = 
E  [Xv | Xu  =  0, n']  because  if  u  does  not  follow  the  agent,  the  follow  probability  is  the 
same  under  both  policies  when  the  agent  interacts  with  v. 

For  any  vertex  w  that  occurs  after  u  and  v  we  have  two  situations.  Either  there 
is  no  path  from  v  to  w,  in  which  case  5W  —  0  because  v  cannot  affect  the  follow 
probability  of  w,  or  there  is  such  a  path.  In  the  case  where  a  path  exists,  we  have 

5w=E[Xw\Xv  =  l,n]E[Xv\n] 

+  E{Xw\Xv  =  0,n}(l-E[Xv\n}) 

-E[Xw\Xv  =  iy]E[Xv\^] 

-E[XW\XV  =  0,^(1  -E^Itt']) 

=E  [XJAU  =  1]  (E  [Xv\n]  -  E  [Xv\tx'}) 

+  E  [XW\XV  =  0]  (E  [Xv\tt'\  -  E  [A„|tt]) 

=  (E  [Xv\i r]  -  E  [Xv\n'})(E  [XW\XV  =  1]  -  E  [XW\XV  =  0]) 

=SV(E  [XW\XV  =  1]  -  E  [XW\XV  =  0]) 

>0. 

Here,  in  addition  to  our  result  for  5Vl  we  also  used  the  fact  that  the  expected  value 
of  Xw  conditioned  on  Xv  is  the  same  for  both  policies  because  the  relevant  graph 
state  is  the  same.  This  result  shows  that  5  >  0,  and  n  has  more  expected  follows 
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than  7 t' .  Therefore,  any  policy  can  increase  its  expected  follows  by  swapping  any  pair 
of  adjacent  vertices  so  they  respect  the  partial  order  of  the  underlying  DAG.  This 
process  can  be  continued  until  the  policy  is  a  linear  extension  of  the  DAG  and  no 
further  increase  in  the  expected  follows  is  possible.  □ 

of  Lemma  3.1.2.  We  will  prove  the  result  by  induction.  For  any  1  <  i  <  n  we 
have  that  ~Ei[Xu.+1  \XU.  =  1]  >  ~Ei[Xu  \XU.  =  0]  because  the  follow  probabilities  are 
monotonically  increasing  in  the  overlap  and  n,+i  follows  This  gives  our  base  case 
of  ~E,[XU2\XUl  =  1]  >  E[A,i2 |X?i]  =  0].  We  then  assume  that  E[XUi\XUl  =  1]  > 
¥j[XUi\XUi  =  0]  for  2  <  i  <  n.  For  the  case  of  i  +  1  we  then  have  that  for  k  G  {0, 1} 


E[V.,+1|X„, 


k]  =E[.A„,„|A'„, 

=  l]E[XUi\XUl  =  k}  + 

E|AV+1|X„, 

=  0](1  —  ~E[XUi\XUl  = 

=E|A'„,JX„, 

=  0]  + 

E[X„JX„,  = 

-  k](E[XUi+1\XUi  =  1]- 

E|AV+1|X„, 

=  0]). 

Now  we  define  5  =  E[A,it+1  \XUi  =  1]  —  E[X,it+1  \XUl  =  0].  Using  the  above  result  we 
have 


i  =(E[XW+1|X„,  =  1]  -  E[A'„J+1|A'„,  =  0])(E[A„,|A'„,  =  1]  -  E{XUi\X„  =  0]) 

>  0. 


Above  we  used  the  induction  hypothesis  and  the  monotonicity  of  the  follow  probabil¬ 
ities  in  the  overlap.  □ 


3.1.3  Expected  Follows  on  an  Arborescence 

We  know  that  the  optimal  policy  on  a  DAG  is  a  linear  extension.  We  look  at  the 
expected  follows  of  the  optimal  policy  on  a  subclass  of  DAGs  known  as  arborescences. 
An  arborescence  is  a  DAG  with  a  single  root  vertex,  where  all  vertices  have  at  most 
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one  parent.  In  this  case  each  vertex  follows  only  one  other  vertex  so  the  overlap  of  a 
vertex  with  the  agent  is  either  zero  or  one.  Under  this  setting  we  can  provide  a  closed 
form  expression  for  the  expected  follows  of  the  optimal  policy. 

We  assume  that  the  follow  probability  for  a  target  vertex  v  with  features  set  Fv 
and  overlap  <j)v  is  given  by  p((f>v,  Fv)  =  f(cj)v)g(Fv).  Let  A  =  /( 1)  —  /( 0)  and  for  each 
vertex  v  define  its  susceptibility  as  g(Fv).  Also,  let  Vi(v,G )  be  the  set  of  cycle  free 
directed  paths  of  length  /  in  a  graph  G  which  terminate  on  vertex  v.  For  instance, 
if  v  is  the  root  vertex  of  an  arborescence  G,  then  Vo(v,  G )  =  {v},  and  if  u  is  a  child 
of  the  root,  then  Vq (u,G)  =  {«}  and  V\ (u,G)  =  {(v,w)}.  We  have  the  following 
result  for  the  expected  follows  of  a  linear  extension  policy  (which  is  optimal)  on  an 
arborescence. 


Theorem  3.1.3.  Let  G  =  (V,  E)  be  an  arborescence  with  N  vertices.  Let  J(G)  be 
the  expected  follows  of  a  linear  extension  policy  on  G.  Then 

N- 1 

XG)  =  mj2^'Yi  E  n»(*y-  (3'1J 

1=0  v£V  p£Vi(v,G)  u€p 

This  expression  weighs  each  possible  follow  path  of  length  /  by  A1  and  the  product 
of  susceptibilities  of  vertices  on  the  path.  While  this  expression  gives  the  exact  number 
of  expected  follows  on  an  arborescence  where  vertices  have  different  susceptibilities, 
it  can  be  greatly  simplified  if  we  assume  all  susceptibilities  are  uniform.  For  an 
arborescence,  each  vertex  can  be  classified  by  its  distance  to  the  root.  Let  Z%  be  the 

number  of  vertices  a  distance  i  from  the  root.  Then  we  have  the  following  result. 


Lemma  3.1.4.  Let  G  =  (V,E)  be  an  arborescence  with  N  vertices.  Assume  that  for 
all  vertices  v  G  V ,  g(Fv)  =  1  Let  J{G)  be  the  expected  follows  of  a  linear  extension 
policy  on  G.  Then 


J(G) 


mN_nmG 

1  - A  1-4A 


(3.2) 


In  the  limiting  case  where  A  =  0,  J(G)  =  f(0)N.  This  means  that  when  there  is 
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no  overlap  boost,  the  expected  follows  is  just  the  number  of  vertices  multiplied  by  the 
constant  follow  probability  /( 0).  This  formula  also  provides  insights  on  what  graph 
topologies  are  more  conducive  to  follows.  Consider  two  simple  graphs.  One  is  a  path 
graph  on  N  vertices,  which  we  will  refer  to  as  Pn,  and  the  other  is  a  star  graph  with 
a  single  root  with  N  —  1  children,  which  we  will  refer  to  as  Siy.  For  P/v,  Zt  —  1  for 
0  <  i  <  N  —  1  and  for  Sn  Zq  —  1  and  Z\  —  N  —  1.  Using  equation  (3.14)  we  find 
that 

tip  '  ME 

J[  n)  l-A  (l-A)2 

f(0)N  A/(0)(1  +  A(A-1)) 

^  N>  l-A  l-A 


Taking  the  difference  of  these  two  expressions  we  find  that 


J(Pn)  ~  J(SN ) 


A/(0)  /1-AN 
l-A  V  1- A 


1- A(A-l) 


Because  0  <  A  <  1,  this  difference  is  positive,  so  a  path  topology  is  better  for¬ 
getting  follows  than  a  star.  This  is  expected  because  a  path  allows  the  agent  multiple 
opportunities  to  gain  overlap,  whereas  in  the  star,  if  the  root  does  not  follow,  there 
will  be  no  overlap  for  subsequent  interactions.  This  analysis  gives  us  the  insight  that 
in  general,  DAGs  with  longer  paths  produce  more  follows. 


of  Theorem  3.1.7.  We  make  use  of  the  following  lemma. 

Lemma  3.1.5.  Let  G  =  (V,  E)  be  an  arborescence  rooted  at  u\.  Consider  the  path 
(ui,u2, ... ,Ui )  in  G  and  assume  the  agent  interacts  with  the  vertices  in  sequence  along 
this  path.  Let  X f  be  the  random  variable  which  is  one  if  U{  follows  as  a  result  of  the 
agent  interaction  sequence  and  zero  otherwise.  Then 

n—  1 

ew=/(«)^a'  Y.  <3-3) 

i=o  peVi(ui,G)  uep 

With  this  lemma  we  can  easily  calculate  the  expected  follows  of  a  linear  extension 
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policy  on  G.  Note  that  because  G  is  an  arborescence,  each  vertex  has  a  unique  path 
from  the  root  vertex.  Therefore,  we  can  directly  apply  Lemma  3.1.8  to  each  vertex 
using  this  path  and  then  sum  over  all  vertices.  Doing  so  we  obtain 


J(G)  =  ^  E  [A'J 

n—  1 

=  /«»EA,E  E  IDm- 

1=0  v£V  p&Vi(v,G)  u£p 


□ 


of  Lemma  3.1.4-  If  we  set  the  susceptibilities  g(Fu)  to  one  for  all  vertices,  the  ex¬ 
pected  follows  reduces  to 

N—l 

J'(G)  =  /(0)  E  A'E  <3’4) 

1=0  v&V 

We  need  to  count  the  number  of  vertices  that  are  the  final  vertex  in  a  directed  path 
of  length  l  for  0  <  /  <  IV  —  1.  This  can  be  easily  done  by  noting  that  the  only 
vertices  that  end  a  path  of  length  /  must  be  at  a  distance  of  l  or  greater  from  the 
root.  That  is,  there  are  vertices  that  terminate  a  path  of  length  /.  Using 

this,  the  expected  follows  becomes 


N—l  N—l 


ag)  =  /( o)  E  A'  E 


1=0  i=l 

N—l  i 


/( o)  E  E  A' 

1=0 

1  -  A*+1 


i= 0  1=0 

N—l 


w)  E  z* 

i= o 

f(0)N  /( 0)A 


1  -  A  1  -  A 


1  -  A 

N- 1 

E 


i=0 


Above  we  have  used  the  fact  that  Xlilo1  ^  =  N 


□ 


of  Lemma  3.1.8.  We  prove  the  result  by  induction.  To  simplify  notation,  let  the 
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feature  set  of  tq  simply  be  Ft.  For  i  —  1,  we  have  E  [Ah]  =  f(0)g(Fl).  The  induction 
hypothesis  is  E  [Ah]  =  /( 0)  A'  E^k.g)  UueP9(Fn)-  For  i  +  1  we  have 

E  [Ah+i]  =E  [Xi+i\Xi  =  1]  E  [A*] 

+  E  [Aj+1|Aj  =  0]  (1  —  E  [Xj]) 

=E  [Ah]  (E  [Xi+1\Xi  =  1  ]-E  [Xi+1\Xi  =  0]) 

+  E  [Xi+1\Xi  —  0] 

N- 1 

=Ag(Fi+1)f(0)  EA'  If  g(Fu)  +  f(0)g(Fi+i) 

1=0  uGPi(m) 

N—l 

=m  E  A'  II 

/=0  u€.'Pi(ui+\iG) 

□ 


3.1.4  Optimal  Policy  on  a  Graph  with  a  Single  Cycle 

We  now  consider  a  graph  with  N  vertices  which  has  a  single  directed  cycle  on  n  vertices 
and  assume  that  the  vertex  susceptibilities  are  all  equal  to  one.  The  vertices  in  the  cy¬ 
cle  are  {ui,  u2, ...,  un}  and  the  edges  of  the  cycle  are  {(wi,  u2),  (u2,u3), (un_i,  un),  ( un 
We  also  assume  each  cycle  vertex  Ui  is  followed  by  Dt  vertices  who  follow  no  one  else. 
We  refer  to  this  set  of  vertices  as  F,  for  cycle  vertex  iq.  An  illustration  of  this  graph 
structure  is  shown  in  Figure  3-2.  This  graph  is  not  a  DAG  so  a  linear  extension 
cannot  be  calculated.  However,  there  are  n  possible  DAGs,  each  one  determined  by 
which  cycle  node  is  the  root.  The  optimal  policy  will  then  be  determined  by  one  of 
these  DAGs. 

To  determine  the  optimal  DAG  we  use  the  following  notation.  Let  7 q  be  the  cyclic 
shift  of  the  sequence  {l,2,...,n}  that  has  i  as  the  first  element.  Each  DAG  of  the 
graph  is  associated  with  one  of  the  7Tj  with  iq  as  the  root,  and  therefore,  the  optimal 
policy  will  correspond  to  one  of  the  i q.  Denote  the  index  of  the  optimal  policy  by  i*. 
The  following  lemma  characterizes  the  optimal  policy. 


Ml)}. 
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D,=  2 


o 


D,.l 

Figure  3-2:  Example  of  a  directed  graph  with  a  single  cycle  for  n  =  4. 

Lemma  3.1.6.  Assume  f(Fv )  =  1  for  Then  the  index  of  the  optimal  policy 

for  the  follow-back  problem  with  initial  graph  which  contains  a  single  n  vertex  cycle 
is 

n 

i*  =  arg  min  ^  A lDniW.  (3.5) 

l<t<n  ' 

1=1 

Recall  that  A  <  1.  We  see  from  this  theorem  that  what  the  optimal  policy  does 
is  select  a  DAG  such  that  the  vertices  closer  to  the  root  have  a  lower  number  of 
followers.  This  makes  intuitive  sense  because  one  would  prefer  to  hold  off  on  a  cycle 
vertex  with  many  followers  because  before  attempting  connect  with  these  followers, 
one  would  like  to  maximize  the  overlap  with  the  parent  vertex.  As  an  example,  for 
the  graph  in  Figure  3-2,  for  0  <  6  <  1  the  optimal  policy  corresponds  to  the  DAG 
rooted  at  vertex  113  which  also  has  the  lowest  number  of  followers  among  the  cycle 
vertices. 


of  Lemma  3.1.6.  Let  Gi  be  the  DAG  rooted  at  u j.  This  DAG  is  also  an  arborescence, 
so  we  can  use  equation  (3.14)  to  calculate  the  expected  follows.  The  sequence  of 
degrees  of  the  cyclic  vertices  in  Gi  is  given  by 

Dw.(2),  ...,  D^^n) }.  This  degree  sequence  is  what  will  determine  the  optimal 
DAG.  For  Gi,  we  have  that  Zq  =  1,  Zn+ 1  =  D7Ti^n'),  and  Zi  =  1  +  Dni(i)  for  1  <  l  <  n. 
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Applying  equation  (3.14)  we  get 


J{Gi)  = 


N+H  A'B*. 


AW  f(  0)A 
1  -  A  1  -  A 


( l ) 


J=1 


Z=1 


To  minimize  this  expression  one  selects  the  index  i  to  minimize  Yl'i= i  (0-  D 


3.1.5  Expected  Follows  on  a  DAG 

We  know  that  the  optimal  policy  on  a  DAG  is  a  linear  extension.  A  natural  question 
to  ask  is  what  is  the  expected  number  of  follows  the  optimal  policy  achieves  on  a  DAG? 
Here  we  provide  a  closed  form  expression  for  the  expected  follows  of  the  optimal  policy, 
under  an  additional  assumption  on  the  overlap  function  /(</>„).  We  assume  that  the 
follow  probability  for  a  target  vertex  v  with  features  set  Fv  and  overlap  <f>v  is  given  by 
p((f>v,Fv)  =  f((j>v)g(Fv).  To  obtain  a  simple  closed  form  expression,  we  assume  that 
f(4>v)  is  monotonically  increasing  and  linear,  i.e.  f(4>v)  =  a  +  fd<f>v  for  some  ot,f3  >  0. 
For  each  vertex  v,  we  define  its  susceptibility  as  g(Fv).  Also,  let  Vi(v,G )  be  the  set 
of  cycle  free  directed  paths  of  length  /  in  a  graph  G  which  terminate  on  vertex  v.  For 
instance,  if  v  is  a  root  vertex  of  a  DAG  G,  then  Vo(v,  G)  =  {v},  and  if  u  is  a  child  of 
v,  then  Vq (u,  G)  =  {«}  and  V\ (u,  G)  =  {(v,  w)}.  We  have  the  following  result  for  the 
expected  follows  of  a  linear  extension  policy  (which  is  optimal)  on  a  DAG. 


Theorem  3.1.7.  Let  G  =  (V,  E)  be  a  DAG  with  N  vertices.  Let  the  follow  probability 
for  a  target  vertex  v  with  features  set  Fv  and  overlap  <fv  be  given  by  p(<f>v,  Fv)  = 
f{4>v)g{Fv)  and  let  f(4>v)  =  a  +  /3(j)v  for  some  a,/3  >  0.  Let  J(G)  be  the  expected 
follows  of  a  linear  extension  policy  on  G.  Then 


N- 1 

J(G)=aJ2P'T,  E  WT 

1=0  tier  TeVt(v,G) 


(3.6) 
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where  for  a  path  T  we  define 


wT  =  l[g(Fu).  (3.7) 

u£T 

3.1.6  Proof  of  Theorem  3.1.7 

We  make  use  of  the  following  lemma. 

Lemma  3.1.8.  Let  G  =  (V,  E)  be  a  DAG  with  N  vertices.  Let  the  follow  probabil¬ 
ity  for  a  target  vertex  v  with  features  set  Fv  and  overlap  be  given  by  p(cf>v ,  Fv )  = 
f{fiv)g{Fv)  and  let  f(4>v)  =  a  +  fi<f>v  for  some  a,/3  >  0.  Assume  the  agent  interacts 
with  the  parents  of  a  vertex  v  before  it  interacts  with  v.  Let  Xv  be  the  random  vari¬ 
able  which  is  one  if  v  follows  as  a  result  of  the  agent  interaction  sequence  and  zero 
otherwise.  Also,  let  P{y)  denote  the  set  of  parent  vertices  of  v.  Then 

N- 1 

E[X„]  =  a^/3‘  £  n  »<*■.>■  (3'8) 

1=0  p£Vi(ui,G)  u£p 

With  this  lemma  we  can  easily  calculate  the  expected  follows  of  a  linear  extension 
policy  on  G.  By  directly  applying  Lemma  3.1.8  to  each  vertex  and  then  summing 
over  all  vertices.  Doing  so  we  obtain 

J(G)  =  J2  E  PC] 

vev 

n—  1 

i=o  veV  PeVi(v,G)  uep 

3.1.7  Proof  of  Lemma  3.1.8 

We  require  the  following  result. 

Lemma  3.1.9.  Consider  a  vertex  v  in  a  DAG  G  =  (V,  E)  with  parents  given  by  the 
set  P(v)  C  V ,  |P(u)|  =  n.  Let  the  susceptibility  and  overlap  functions  be  g  and  f, 
respectively .  Assume  the  vertices  are  interacted  with  according  to  a  linear  extension 
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policy  7T.  Let  qv  =  E  [X„|7r].  Then  we  have  that 

n 

qv  =  g(Fv)  YA k  Y  II Clu  (3-9) 

k= 0  SCP(v):\S\=k  u£S 

where  we  define  Ak  =  Yi=oaif(^)  where  afi  =  a\zl  —  af-1,  ak+i  =  ak  =  1? 
Oq  =  (— l)k,  and  a^i  =  0. 

If  we  assume  that  f{4>)  =  a  +  fifi,  then  equation  (3.9)  takes  a  simpler  form.  We 
use  the  following  result  regarding  the  terms  a\ . 


Lemma  3.1.10.  Let  the  terms  a \  be  given  in  Lemma  3.1.9.  Then  we  have  that 
E?=  o  ai  =  0  for  k  >  0,  E’=o  =  1,  a^d  Yi=o  a\%  =  0  for  k  >  1. 


For  linear  /,  the  terms  Ak  take  the  following  form. 

k  k 

Afc  =  a  Y  ai  +  & ^2  a^i 

i= 0  i=0 


Using  Lemma  3.1.10  we  hnd  that 


A0  =  a 
A1  =  /3 

Afc  =  0,  k  >  1. 


Substituting  this  into  equation  (3.9)  we  obtain 


Qv 


9(FV) 


(3.10) 
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This  expression  is  a  recursion  in  the  depth  of  the  DAG.  If  we  expand  it  we  obtain 


Qv  =g(Fv)  (  a  +  (3  Y  9(FU)  |  a  +  p  Y  Q«, 

u£P(v)  V  w£P(u) 


—a 


—a 


g(Fv )  +  a/3  Y  g(Fu)g(Fv)  +  2  E  E  Qwg(Fu)g(Fv') 

u£P(v)  u£P(v)  w£P(u) 

g{Fv)  +  a/3  Y  g(Fu)g(Fv)  +  f32  E  E  9(FMF-u)g(Fv)  Lu^,. 

u£P(v)  uEP(v)  teP(u)  V  sEP(t) 


=a 


g(Fv)  +  a/3  Y  g(Fu)g(Fv)  +  a(32  E  E  g(Ft)g(Fu)g(Fv 

ueP(v)  u£P(v)  teP{u) 

+  P3  Y  Qsg{Ft)g(Fu)g(Fv) 

seP(t) 


--a  i  Y  g(Fu)  +  Y  g(Fu)g(Fv)  +  Y  g(Ft)g(Fu)g(Fv 

,  (aJePoh, G)  (u,v)GVi (y,G)  ( t,u,v)eP3(v,G ) 


+  /33  ^  qsg(Ft)g(Fu)g(Fv). 

(s,t,u,v)&Vi(y,G) 


If  we  continue  this  expansion  higher  np  the  DAG,  then  this  pattern  continues  until  we 
hit  the  roots,  at  which  point  there  are  no  more  parents  and  the  expansion  terminates, 
since  for  a  root  r,  we  have  trivially  that  qr  =  ag(Fr).  What  is  happening  is  then 
evident.  The  expression  of  qv  is  summing  over  all  paths  in  the  DAG  which  terminate 
on  v.  Each  such  path  p  containing  l  vertices  is  weighted  by  a/31^1  Y\uepg(Fu).  Using 
this,  we  obtain 


N 

*>=“E^  e  n  g(Fu).  (3.ii) 

1=0  p£Pi(v,G)  u£p 

The  total  follows  on  the  DAG  is  given  by 


J(G)  =  ^  9» 

vev 

N 

=aYYvl  n^)- 

1=0  vev  pePt(v,G)u£p 
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3.1.8  Proof  of  Lemma  3.1.9 


We  prove  the  result  by  induction.  The  induction  hypothesis  is  given  by  equation 
(3.9).  We  assume  that  v  has  n  parents.  For  n  =  0,  we  have  qv  =  f(0)g(Fv),  which 
satishes  the  induction  hypothesis.  For  n  =  1,  let  the  parent  be  u.  From  the  dehnition 
in  Lemma  3.1.9  we  have  that  A0  =  /( 0)  and  Ai  =  /( 1)  —  /( 0).  Then  we  have 

Qv  =  9(Fv)  (/( 0)(1  -  qu)  +  /(l)g«) 

=  g(Fv)  (/(o)  +  ?u(/(l)  —  /(o))) 

=  d(Fv)  (Ao  +  Ai) 

which  also  satishes  the  induction  hypothesis.  For  the  n  +  1  case,  let  the  additional 
vertex  added  be  denoted  n  +  1.  Conditional  on  this  vertex  not  following,  qv  is  not 
changed.  Conditional  on  this  vertex  following,  the  overlap  increases  by  one.  The 
value  of  qv,  conditioned  on  this  event,  is  given  by  equation  (3.9),  except  that  the 
argument  of  /  is  increased  by  one  in  the  dehnition  of  A*,.  To  ease  our  notation,  let 
us  define  A),  =  J2i=o  aif(i  +  !)•  We  let  P(v)  be  the  parents  of  v  not  including  n  +  1. 
Then  we  have 


qv  —  E[A„|Xn+1  —  0]  ( 1  —  qn+i)  +  E[A^|An+1  —  l]qn+\ 

n 

=  (1  -  qn+i)g(Fv)  Ak  Y  II  Qu 

k=0  SCP(v):\S\=kueS 
n 

+  qn+ig(Fv)  e  n«» 

k= 0  SCP(v):\S\=kueS 
n 

=g(Fv)  YAk  II qu 

k= 0  SCP(v):\S\=kueS 

n 

+  qn+ig(Fv)  e<a!.  -  e  n?-  <3’i2> 

k= 0  SCP(v):\S\=ku£S 
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The  difference  A'k  —  Ak  is  given  by 


k 

Ai-4=y«‘i/(.+i)-/  wi 

i=0 

=  4m + 1)  -  4m + «ti m)  ~ak+k~  1  m  - 1)  + ... + «j/(i)  -  asm 
=  f(k  +  1)(<4  -  4+i)  +  f(k){4-i  -  4)  +  ...  +  /(0)(<4  -  a*,) 

k+1 

i= 0 

=  Ak+i. 

Above  we  used  the  definition  of  a\  given  in  Lemma  3.1.9.  Substituting  this  into 
equation  (3.12)  and  letting  P'{y)  denote  the  parents  of  v  including  n  +  1,  we  obtain 

n  n 

qv  =9(FV )  e  n  Qu  “1“  Qn-\-l9^-^v)  ^  ^  ^fc+1  e  n* 

k=0  SCP(v):\S\=kueS  k= 0  SCP(v):\S\=k  ueS 

n  n+1 

=9{k)  n qu  +  A  k+i  e  n* 

k=0  SCP(v):\S\=ku£S  k= 1  SCP’(v):\S\=kueS 

n+ 1 

=g{Fv)  ^2Ak  JJ  qu- 

k= 0  SCP'(v):\S\=ku£S 

The  above  expression  matches  the  induction  hypothesis,  completing  the  proof. 


This  expression  weighs  each  possible  follow  path  T  of  length  l  by  (31wt ■  The 
expected  number  of  follows  is  obtained  by  summing  over  all  possible  paths  in  the 
DAG.  If  we  look  at  each  term  corresponding  to  a  path  length  l  in  the  expression 
for  J(G)  we  can  gain  a  better  understanding  of  how  the  overlap  plays  a  roll  in  the 
expected  follows.  For  each  value  l  we  define 

J\G)  =  a(3l  E  E  wt  (3.13) 

v&V  T&Vi(v,G) 

as  the  contribution  to  the  total  expected  follows  J(G)  coming  from  a  specific  value  of 
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I  in  the  sum  in  equation  (3.6).  With  this  notation,  we  can  rewrite  equation  (3.6)  as 


N- 1 

J(G)  =  Y,  AG).  (3.14) 

1=0 

For  /  =  0  we  have 

J°(G)  =  aJ2g(Fv). 

v£V 

That  is,  the  zeroth  order  contribution  to  J(G)  is  the  sum  of  the  vertex  susceptibilities. 
This  makes  sense,  as  a  vertex  is  a  one  hop  path.  The  value  J°(G)  is  the  expected 
follows  that  would  be  obtained  if  the  overlap  had  no  impact  of  the  follow  probability. 
The  first  order  term  is 

j\G)=atl  Y,  9{Fu)g{Fv ), 

(u,v)gE 

which  is  simply  the  sum  over  all  the  edges  in  the  graph,  with  edge  weights  given  by 
the  product  of  the  susceptibility  of  their  end  vertices.  Edges  can  also  be  thought  of 
as  the  one-hop  paths  in  the  graph.  Continuing  this  way  we  obtain  contributions  from 
higher  order  paths,  up  to  paths  of  length  N  —  1  (the  longest  path  possible  in  a  graph 
with  N  vertices).  A  useful  computational  question  is  how  many  terms  to  keep  in  this 
sum  when  actually  calculating  J(G).  This  is  entirely  dependent  upon  the  density  of 
the  graph  and  the  value  of  the  susceptibilities  and  overlap  function.  To  understand 
this  question,  let  us  assume  that  all  susceptibilities  are  equal  to  some  value  7.  Let 
us  also  define  TV)  as  the  number  of  paths  in  a  graph  of  length  l.  With  this  notation, 
Nq  is  the  number  of  vertices  and  Nl  is  the  number  of  edges.  For  ease  of  notation,  we 
define  N  =  N0.  Under  this  scenario,  all  paths  of  equal  length  have  the  same  weight. 
Therefore,  we  can  write  Jl(G )  as 

J\G)  =  onifo  )lNh 

The  first  term  ay  is  the  same  for  all  /  and  can  be  ignored.  The  term  /hy  <  1  because 
the  follow  probability  must  be  less  than  or  equal  to  one  and  we  assume  a  >  0.  The 
term  Nj  depends  upon  the  density  of  the  graph.  For  a  sparse  graph  iVj  will  scale  like 
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N,  but  for  a  dense  graph  we  can  have  JVj  scaling  like  Nl .  Therefore,  for  sparse  graphs 
Jl(G)  will  decay  exponentially  fast  in  /hy,  while  for  denser  graphs  it  will  scale  like 
(/ 3ylV )l  which  can  decay  or  increase  depending  upon  the  value  of  relative  to  n.  If 
the  overlap  is  weak  (small  (3)  or  the  graph  is  small,  then  (i'yn  will  be  less  than  one 
and  the  higher  order  terms  will  be  much  smaller  than  the  lower  order  terms.  If  the 
overlap  is  strong  (large  /3)  or  the  graph  is  large,  then  higher  order  terms  may  not  be 
negligible. 
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Chapter  4 


Graph  Heuristic 

4.1  Optimal  Policy  on  a  Directed  Graph 

The  linear  extension  policy  for  a  DAG  does  not  apply  to  a  general  directed  graph 
because  there  is  more  than  one  DAG  for  the  graph  due  to  the  presence  of  cycles. 
However,  the  DAG  analysis  suggests  that  whatever  the  optimal  policy  is,  it  must  not 
violate  the  order  relations  imposed  by  the  graph.  In  fact,  a  simple  application  of 
Lemma  3.1.2  gives  us  the  following  result,  which  we  state  here  without  proof. 

Lemma  4.1.1.  Let  n*  be  the  optimal  policy  for  an  arbitrary  directed  graph  G  =  (V,  E) 
and  let  u,v  E  V.  If  there  is  a  directed  path  in  G  from  u  to  v,  but  not  from  v  to  u, 
then  u  must  come  before  v  in  n* . 

This  lemma  is  a  generalization  of  Theorem  3.1.1  to  arbitrary  graphs.  It  says 
that  if  there  is  a  path  between  two  vertices  in  one  direction,  but  not  in  the  reverse, 
then  the  vertex  at  the  start  of  the  path  should  always  be  followed  before  the  other. 
Furthermore,  because  the  optimal  policy  must  respect  the  order  imposed  by  the  graph, 
then  the  optimal  policy  should  be  a  linear  extension  of  a  DAG  of  the  underlying  graph. 
Because  all  linear  extensions  of  a  DAG  are  optimal,  finding  the  optimal  policy  reduces 
to  finding  the  optimal  DAG.  Formally,  let  T>(G)  be  the  set  of  all  possible  DAGs  of 
G.  The  optimal  DAG  is  the  one  that  maximizes  the  expected  follows.  That  is,  the 
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optimal  DAG  is  the  solution  to  the  following  optimization  problem. 

max  J(D ).  (4.1) 

DeV(G) 

There  are  some  features  the  optimal  DAG  should  have.  Consider  the  expression  for 
J(G)  for  a  DAG  G  from  equation  (3.14).  The  term  J°(G)  is  independent  of  the  DAG 
chosen  as  long  as  all  vertices  are  included.  However,  Jl(G)  will  be  larger  if  there 
are  more  edges  with  heavier  weights  (product  of  end  vertex  susceptibilities).  The 
same  goes  for  larger  values  of  /.  This  suggests  that  when  searching  for  the  optimal 
DAG,  one  would  like  to  have  as  many  “heavy”  edges  as  possible.  In  the  case  when 
the  susceptibilities  are  uniform,  this  reduces  to  having  a  DAG  with  as  many  edges  as 
possible.  This  is  known  as  the  minimum  feedback  arcset  problem  and  is  well  known 
to  be  NP-hard  to  solve  [23].  Therefore,  finding  an  exact  solution  to  (4.1)  may  not  be 
easily  done.  However,  an  exactly  optimal  solution  may  not  be  necessary,  as  the  higher 
order  terms  in  J(G)  may  not  be  significant,  depending  upon  the  graph  structure  and 
follow  probabilities.  Instead,  heuristic  solutions  may  be  sufficient  for  practice.  We 
will  look  at  such  heuristic  policies  next. 

Our  analysis  thus  far  has  shown  that  the  optimal  policy  on  a  graph  should  corre¬ 
spond  to  a  DAG  of  the  underlying  graph.  We  have  also  seen  how  to  calculate  expected 
follows  on  a  DAG  from  Theorem  3.1.7.  In  addition,  by  studying  the  structure  of  the 
expression  for  the  expected  follows  we  have  seen  that  more  edges  in  the  DAG  can  lead 
to  increased  expected  follows.  Combining  these  results  we  now  propose  a  heuristic 
policy  for  an  arbitrary  directed  graph  G.  Give  each  vertex  v  €  V  a  weight  gv.  Weight 
the  edges  of  G  by  the  product  of  its  incident  vertices.  So,  an  edge  (i,  j)  has  weight 
QiQj.  Then  form  a  DAG  from  G  as  follows.  Let  B  be  the  maximum  branching  forest 
of  G.  Recall  that  a  maximum  branching  forest  for  a  directed  graph  is  a  subgraph 
such  that  each  vertex  has  at  most  one  parent  and  the  edges  included  in  the  subgraph 
are  of  maximal  weight.  Then  add  edges  to  B  with  the  following  heuristic. 

Create  a  list  of  edges  E  of  G  such  that  they  are  not  in  the  maximum  branching. 
Then  while  this  list  is  non-empty,  do  the  following.  Select  the  vertex  v  of  of  B  with 
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smallest  in-degree,  ties  broken  arbitrarily.  Then  select  the  edge  from  E  incident  to  v 
with  maximal  weight.  Update  B  by  adding  this  edge  to  the  graph.  If  the  updated 
graph  contains  a  cycle,  we  remove  the  newly  added  edge.  Remove  the  edge  from  E 
and  continue.  Each  edge  added  has  a  chance  of  creating  a  cycle.  We  would  like  to 
add  as  many  edges  as  possible  to  the  graph.  By  attempting  to  append  the  edges  in 
a  smart  mannner,  we  minimize  the  chance  that  each  new  edge  forms  a  cycle. 

The  study  of  Directed  Acyclic  Graphs  (DAGs)  overlaps  with  the  study  of  partially 
ordered  sets,  since  a  partial  order  may  be  imposed  on  the  nodes  of  a  DAG.  Constrain¬ 
ing  the  graph  the  graph  to  be  acyclic  ensures  that  it  still  constitutes  a  valid  partial 
order.  [33]  discusses  the  process  of  “refining”  a  partial  order.  By  appending  edges 
from  the  parent  “friends”  graph  G  onto  U,  the  heuristic  is  refining  the  partial  order 
imposed  on  the  nodes  of  the  graph,  making  it  more  precise.  We  are  also  getting  a 
better  estimate  of  the  true  expected  follows  of  a  linear  extension  policy. 

Let  us  refer  to  DAG  produced  as  a  result  of  this  procedure  as  Dv .  We  will  define 
the  follow-back  score  of  Dv  to  be  the  expected  follows  on  the  DAG,  i.e.  J(DV),  as 
given  by  equation  (3.6).  The  heuristic  attempts  to  incorporate  edges  in  a  smart  way 
to  increase  the  expected  follows  of  the  DAG.  Then  our  follow-back  policy  will  be  to 
interact  with  the  vertices  of  G  in  a  sequence  that  is  a  linear  extension  of  this  DAG. 
While  this  heuristic  is  not  guaranteed  to  produce  the  optimal  DAG,  we  will  see  in 
Section  4.2  that  the  policies  we  obtain  perform  well. 


4.2  Results 


In  this  section  we  compare  our  follow-back  score  heuristic  policy  to  several  other 
policies  on  real  Twitter  graphs.  The  policies  we  compare  with  are  based  on  a  various 
network  centrality  measures.  We  use  simulations  to  evaluate  the  policies  with  follow 
probabilities  given  by  the  results  of  our  empirical  analysis. 
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ALGORITHM  1:  Follow-back _ 

Input:  Target  friend  graph  G  =  (V,E); 
Maximal  branching  B  =  (V,  EB)  of  G; 
while  E  7^  0  do 
Order  E] 
e  =  E[  0]; 
if  e  ^  -Lb  then 
Eb  =  {e}  U  Eb  ; 
if  B  is  cyclic  then 
Eb  =  Eb  \  {e} 
else 

continue 

end 

else 

continue; 

end 

end 

Output:  DAG  B. 


4.2.1  Simulation  Methodology 

We  evaluate  the  performance  of  a  policy  on  a  Twitter  graph  as  follows.  We  simulate 
an  interaction  with  the  the  first  vertex  in  the  policy.  The  result  of  this  interaction  is 
given  by  a  Bernoulli  random  variable  that  is  one  with  probability  given  by  the  follow 
probability  from  3.1.1.  We  then  update  the  overlap  of  any  other  vertex  in  the  graph 
as  a  result  of  this  interaction.  In  particular,  if  this  vertex  follows  back,  then  all  of 
its  children  in  the  graph  increase  their  overlap  by  one.  We  note  that  all  edges  of 
the  graph  are  used  to  simulate  the  follows.  We  repeat  this  process  for  each  vertex  in 
the  policy  until  all  vertices  have  been  interacted  with.  When  we  reach  the  end  of  the 
policy,  we  evaluate  how  many  follows  were  received.  We  repeat  this  process  for  10,  000 
trials  to  obtain  the  distribution  of  the  follows  of  the  policy.  For  our  follow  probability 
model,  we  use  the  coefficients  learned  in  our  regression  2.4.  However,  we  decided 
to  remove  the  coefficient  Bq  from  the  model  in  simulation.  The  reason  we  did  this 
was  to  inflate  the  prior  follow  probabilities  of  each  target.  Since  the  agents  that  we 
used  to  learn  the  follow  probability  model  used  the  Twitter  programming  interface  to 
automatically  post  content,  we  assumed  that  their  ability  to  gain  followers  would  be 
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Table  4.1:  Twitter  User  Descriptions 


User 

Description 

@billmcraven 

Chancellor  of  University  of  Texas  System 

@DJ44 

Chief  Data  Sceintist  of  the  United  States  Office  of  Science  and  Technology  Policy 

@McKinsey 

Management  consulting  firm 

@MIT 

Institution  of  higher  education 

@ORCenter 

Academic  department  within  institution  of  higher  education 

@POTUS 

United  States  President 

@realDonaldTrump 

Businessman  and  presidential  candidate 

@taylorswiftl3 

Celebrity  singer 

@zlisto 

Business  school  professor 

significantly  less  than  a  human  operator  who  could  control  the  account  and  interact 
with  greater  skill. 


4.2.2  Twitter  networks 

Our  test  graphs  for  the  simulation  analysis  are  the  “friends  graph”  of  nine  Twitter 
users,  ranging  from  different  types  of  accounts  and  levels  of  public  notoriety.  We 
list  brief  descriptions  of  each  user  in  4.1.  Their  friends  graphs  have  very  different 
structural  properties  which  we  list  in  4.2.  Friends  graphs  for  celebrity  users  such 
as  @taylorswiftl3  tend  to  be  more  dense.  This  is  because  celebrities  tend  to  follow 
other  celebrities  who,  in  turn,  follow  one  another.  Also,  the  median  follower  count 
of  the  vertices  in  celebrity  friend  graphs  like  @taylorswiftl3’s  graph  is  two  orders  of 
magnitude  larger  than  typical  users  like  the  MIT  academic  department’s  account, 
@ORCenter.  This  is  because  most  people  @taylorswiftl3  follows  are  also  celebrities 
with  large  Twitter  followings.  The  result  of  this  is  that  we  expect  the  susceptibilities 
to  be  much  lower  for  the  vertices  of  in  celebrities’  graphs  than  in  typical  users’  graphs. 
This  means  that  @taylorswiftl3’s  graph  will  be  harder  to  penetrate  and  we  expect  to 
get  fewer  follows  compared  with  a  graph  that  of  @ORCenter. 


4.2.3  Network  centrality  policies 

We  compare  our  follow-back  score  policy  to  several  other  network  centrality  based 
policies.  Network  centralities  are  functions  that  map  a  vertex  in  a  graph  to  a  real 
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number.  There  are  several  different  network  centralities  to  choose  from,  such  as  de¬ 
gree  centrality,  eigenvector  centrality,  or  betweenness  centrality.  For  each  network 
centrality,  we  use  the  following  policy:  rank  the  vertices  in  order  of  increasing  or 
decreasing  network  centrality.  The  intuition  for  this  policy  comes  from  the  fact  that 
network  centralities  give  scores  to  vertices  that  reflect  their  importance  in  different 
graph  metrics.  Our  network  centrality  policies  simply  interact  with  the  vertices  in 
either  increasing  or  decreasing  importance  with  respect  to  the  chosen  network  cen¬ 
trality. 

In  addition  to  the  network  centrality  based  policies,  we  also  look  at  a  random 
policy,  where  the  policy  is  a  random  permutation  of  the  vertices.  This  policy  serves 
as  a  baseline  and  represents  how  many  follows  would  be  achieved  without  using  any 
information  about  the  graph  structure  or  vertex  susceptibility. 


4.2.4  Simulation  Results 

We  test  the  different  policies  (network  centralities,  follow-back  algorithm,  and  ran¬ 
dom)  on  the  friends  graph  of  the  users  listed  in  4.1.  The  results  are  shown  in  Table 
4.3,  where  each  number  is  the  average  number  of  follows  gained  in  the  user’s  friend’s 
graph  over  10,000  simulations.  As  can  be  seen,  the  follow-back  score  policy  is  fre¬ 
quently  one  of  the  top  performing  policies,  and  there  is  a  very  small  difference  between 
the  expected  follows  of  the  top  policies.  Also,  “smart  ”  policies  achieve  as  much  as 
10%  more  follows  than  their  inverse  order.  This  shows  the  scale  of  gain  possible 
using  intelligent  interaction  policies.  We  note  that  the  gain  over  random  is  roughly 
the  same  for  most  graphs.  This  is  not  surprising  if  one  considers  the  form  of  the 
expression  for  the  expected  follows.  The  random  policy  DAG  most  likely  does  not 
contain  many  edges.  The  expected  follows  of  this  policy  would  be  close  to  J°(G). 
The  follow-back  score  policy  creates  a  DAG  with  more  edges,  so  to  first  order  their 
value  can  be  expressed  as  J°(G)  +  J1(G).  The  fractional  gain  of  the  follow-back  score 
policy  over  random  is  the  approximately  given  by  J1  (G) /  J° (G) .  Expanding  the  ratio 
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of  these  quantities  we  find 


Jl(G)  _  oE(«,qeE  g(Fu)g(Fv) 

J°(G)  ~  EveV9(Fv) 

We  see  that  this  ratio  is  proportional  to  f3.  In  our  simulations,  the  value  of  f3  is  0.29. 
If  we  assume  that  vertex  susceptibilities  all  equal  7,  then  the  ratio  simplifies  to 


J\G) 

J°(G) 


(4.3) 


For  very  dense  graphs,  there  should  be  more  gain  compared  to  sparser  graphs.  Also, 
the  gain  should  be  higher  for  graphs  where  the  vertices  have  a  higher  susceptibility. 
This  is  what  we  observe  in  Table  4.3.  The  gains  are  roughly  the  same  for  both  graphs 
because  while  the  potential  for  gain  in  denser  graphs  is  higher,  denser  graphs  tend  to 
be  associated  with  “celebrity”  users  who  have  lower  susceptibilities.  Sparser  graphs, 
on  the  other  hand,  are  more  commonly  associated  with  typical  users  who  tend  to 
have  higher  susceptibilities.  Therefore,  these  effects  combine  to  give  similar  gains  for 
a  majority  of  the  graphs.  One  reason  why  many  of  the  network  centrality  policies 
perform  well  is  because  their  policies  are  highly  correlated. 
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Chapter  5 


General  Followback 

5.1  Set  up  and  Motivation 

We  now  look  at  a  more  general  version  of  the  follow-back  problem.  Given  an  arbitrary 
directed  graph,  an  agent  might  want  to  gain  follows  from  some  subset  of  targets 
within  the  graph.  This  situation  is  likely  in  marketing  or  advertisement  campaigns. 
Presumably,  there  are  already  individual  users  or  other  companies  within  the  Twitter 
network  that  are  deemed  authority  figures  on  the  company’s  product  or  area  of  focus. 
These  prominent  users  probably  have  large  followings  that  the  company  undergoing 
the  advertising  campaign  wishes  to  reach.  If  the  company  can  gain  followers  among 
the  “authority-like”  users,  it  will  have  access  to  these  user’s  pre-existing  follower-bases. 
This  is  precisely  the  problem  that  we  address  in  this  chapter:  when  the  agent  seeks 
to  gain  follows  from  multiple  targets  simultaneously  and  is  permitted  to  interact  with 
the  friends  of  the  targets. 

5.2  Integer  Program 

We  use  constrained  integer  programming  to  formulate  the  problem.  We  add  a  con¬ 
straint  that  limits  the  number  of  interactions  that  the  target  is  allowed  and  makes 
the  problem  more  practically  applicable.  Despite  the  increased  complexity  of  the 
problem,  we  use  the  intuition  gained  from  earlier  analysis  to  formulate  our  objective. 


63 


We  seek  to  find  a  directed  acyclic  subgraph  with  the  maximum  number  of  paths  of 
maximally  weighted  vertices  that  lead  into  the  targets,  subject  to  the  number  of  ver¬ 
tices  in  the  subgraph  is  less  than  the  maximum  number  of  interactions  allowed.  In 
the  basic  problem  where  all  of  the  vertices  in  the  graph  are  friends  of  the  target,  we 
found  that  deriving  an  optimal  policy  involved  selecting  a  directed  acyclic  subgraph 
of  maximal  followback  score.  Recall  that  followback  score  was  calculated  by  taking  a 
sum  of  all  the  vertex  susceptibility  weighted  dot  products  of  the  paths  in  the  graph 
as  shown  in  3.6.  The  objective  function  in  our  integer  program  is  doing  the  same 
thing;  however,  since  a  general  graph  might  include  vertices  that  are  not  friends  of 
the  targets,  we  want  to  restrict  our  focus  to  paths  that  specifically  end  on  one  of 
the  targets.  Since  the  acyclic  subgraph  constitutes  a  valid  partial  order,  we  can,  as 
before,  derive  an  optimally  sequenced  interaction  policy  by  taking  a  linear  extension 
of  the  graph. 


For  an  arbitrary  directed  graph  G  of  IV  vertices,  give  each  vertex  v  e  V  a  weight 
gv.  Then  form  a  DAG  G*  from  G  using  the  integer  program,  relying  upon  notation 
established  in  3.1.5.  The  objective  function  corresponds  to  3.6  and  the  constraints 
limit  the  maximum  number  of  interactions  that  the  agent  is  allowed  to  c,  ensure  that 
the  graph  is  acyclic,  and  guarantee  that  selected  edges  have  incident  vertices  that 
are  also  selected.  This  DAG  is  optimal  for  the  problem,  and  4.1  showed  us  that  any 
linear  extension  of  the  DAG  is  an  optimal  interaction  sequence. 


Xi  = 


if  Vi  G  G* 
otherwise 


(5.1) 


Vi, 3 


j  1,  if  G,j  £  G* 
I  0,  otherwise 


(5.2) 
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N- 1 

maxE^  +  EE  E  [J1 1]  9u  I] 

iev  teR  1=1  TeVi(t,G)  u&T  (i,j)£T 

Subject  to 

y><c 

iev 

Ui,j  —  E ij  V( i,j )  G  -£/ 

E  ^  <  ici  - 1  vc 

(*J)ec 

Vi,  j 

Xj-yij>0  Vi,  j 

a:*  G  {0, 1} 
j/*j  e  {0,1} 

(5.3) 

5.3  Tract  ability  Issues 

Our  integer  program  is  highly  nonlinear.  The  nonlinear  terms  come  about  from  the 
inclusion  of  higher  order  paths  in  the  objective  function.  The  authors  are  not  aware 
of  any  solvers  that  would  be  able  to  handle  such  a  problem  in  reasonable  time.  We 
will  not  analyze  the  complexity  of  the  general  integer  program,  but  rather,  choose  to 
focus  on  viable  integer-programming  based  heuristics  that  exhibit  good  performance. 

Recall  that  the  objective  function  5.3  sums  over  all  of  the  paths  in  the  DAG  and 
weights  them  by  the  vertex  susceptibilities  along  the  path  as  well  as  a  constant  term 
that  decreases  exponentially  in  path  length.  We  consider  all  weighted  paths  of  length 
l  included  in  the  objective  function  to  be  Ith  order  contribution  to  the  objective  value. 
Note  that  5.3  includes  paths  of  all  lengths,  which,  for  an  N  node  graph,  could  be  as 
large  as  the  N  —  1th  order.  The  higher  order  terms  in  the  function  give  rise  to  the 
high  nonlinearity  of  the  function  since  they  must  include  products  of  the  binary  edge 
variables  ypV(i,  j)  in  each  path. 
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ORC  zlisto 


Figure  5-1:  Contribution  to  objective  value  by  order  for  two  accounts. 


We  explore  the  contribution  to  the  objective  value  that  each  order  term  provides 
for  two  Twitter  graphs  that  were  small  enough  to  allow  us  to  calculate  the  expected 
follows  completely  on  their  induced  DAGs  in  5-1.  Approximately  90%  of  the  objective 
value  for  the  zlisto  account  is  captured  by  the  zero  and  first  order  contributions  of  the 
objective  value  and  more  than  99%  of  the  objective  value  is  captured  by  the  second 
order.  On  the  other  hand,  the  first  order  contribution  only  captures  approximately 
25%  of  the  objective  value  for  the  ORC  account  and  99%  of  the  value  is  not  captured 
until  the  seventh  order.  We  did  not  find  these  results  surprising  since  the  ORC  friends 
graph  is  lOOx  more  dense  than  the  zlisto  friends  graph.  In  general,  we  expect  more 
dense  graphs  to  have  a  greater  proportion  of  their  objective  value  captured  by  higher 
order  paths,  since  a  greater  number  of  longer  paths  exist  in  these  graphs. 
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5.4  Integer  Program  based  heuristic 


5.4.1  Minimum  Feedback  Arc  Set 

Despite  the  impact  that  higher  order  terms  have  in  the  objective  value  on  dense 
graphs,  we  decide  to  truncate  the  objective  function  after  the  second  order  terms.  We 
argue  that  optimizing  over  lower  order  terms  exclusively  is  acceptable  firstly  because 
of  tractability.  Solving  our  objective  to  optimality  in  the  second  order  is  called  the 
minimum  feedback  arcset  problem  and  is  well  known  to  be  NP-hard  to  solve  [23]. 
There  is  an  abundance  of  research  on  the  problem  and  methods  to  solve  it  using 
integer  programming  exist  [2],  We  also  legitimize  focusing  on  lower  order  terms  in 
our  optimization  because  we  saw  that  for  low  density  graphs,  lower  order  terms  carry 
a  high  proportion  of  the  objective  value  5.3,  and  many  of  the  social  media  networks 
of  interest  will  be  of  low  density. 


5.4.2  Formulation 


We  use  graph  centrality  measures  as  a  surrogate  to  capture  some  of  the  granularity 
in  objective  value  lost  by  ignoring  these  terms.  Given  a  graph  centrality  score  q  for 
vertex  i,  we  reformulate  the  integer  program  as  follows.  Note  that  we  decrease  the 
nonlinearity  in  the  objective  by  optimizing  over  one  and  two-hop-paths  alone. 


Xi  = 


Vi,j  — 


if  Vi  G  G* 
otherwise 


if  et)]  G  G* 


otherwise 


(5.4) 


(5.5) 
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max  ^  gtXi  +  ctx%  +  ^  ^  ^  ft  gu 

ten  1=1  TePi{t,G )  «eT  (j,i)eT 


i£V 


Subject  to 


y><c 


i£V 

Vi,j  ^  ^hj 

y  Vij  <  \c\  - 1 

(*J)ec 

-  2/ij  >  0 
xj  -  Vij  >  0 
Xi  e  {0,1} 

Vij  e  {0,1} 


V(*,j)e£ 

VC 

Vi,  j 
Vi,  j 


(5.6) 


The  matrix  associated  with  the  second  order  portion  of  the  objective  is  not  gen¬ 
erally  positive  semi-definite.  To  ensure  that  optimization  solvers  can  handle  the 
formulation,  we  seek  to  make  the  objective  linear.  We  add  additonal  binary  variables 
Zij,k  to  be  associated  with  two  hop  paths  ( i,j ,  k).  This  avoids  the  nonlinearitry  in  5.6 
when  we  multiply  yi,jHj,k  to  represent  the  this  path.  At  the  same  time,  it  increases 
the  number  of  binary  variables  in  our  formulation  exponentially.  For  an  N  vertex 
graph,  this  would  yield  N 3  binary  variables. 
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max  EE  9iXi  +  CiXi  +  l3g%gtiKt  +  /32  E  gigjgtZijt 

x,y,z  t&R  iev  jev 


Subject  to 

y><c 

i£V 

Vi,j  <  EijV(i,j)  G  ,5 

E  ^  ici  - 1VC 

(*,i)6C 


Vi,j 

ajj-y*j>0  Vi,j 

Zi,j,k-Vi,j<  0  Vi,.?',  A: 

Zi,j,k  Vj'k  E  0  Vi,  j,  k 

% i,j,k  Ui,j  // y . A'  “b  1  E  0  Vi,  J,  k 


Xi  e  {0,1} 


Vi,j  e  {0,1} 

Zi,j,k  G  {0,  1} 


(5.7) 


5.5  Simulation 

5.5.1  Network 

We  simulate  our  integer  programming  heuristic  for  the  general  follow-back  problem  on 
a  Twitter  friends  network  of  three  famous  computer  scientists,  Yann  LeCun,  Andrew 
Ng,  and  Sebastian  Thrun.  We  use  the  Twitter  advanced  programming  interface  to 
compile  the  collective  “friends  graph”  which  includes  the  three  different  users,  the 
union  of  their  friends,  and  the  friend/follower  connections  between  all.  The  properties 
of  the  graph  are  in  5.1.  We  use  the  same  simulation  methodology  delineated  in  4.2.1. 
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Table  5.1:  Computer  Scientist  Friends  Graph 


Friends  graph 

@ylecun,  @SebastianThrun,  @AndrewYNg 

Number  of  targets 

3 

Number  of  vertices 

218 

Number  of  edges 

3140 

Density 

0.07 

Mean  degree 

14.40 

Max  out  degree 

66 

Min  out  degree 

1 

Max  in  degree 

113 

Min  in  degree 

0 

Number  weak  components 

1 

Largest  component  diameter 

6 

Sum  of  vertex  susceptibilities 

46.77 

5.5.2  Policies 

We  use  the  linearized  integer  program  5.7  to  solve  for  the  optimal  policies.  For  this 
simulation,  we  set  c  =  20,  implying  that  the  agent  is  allowed  to  interact  with  at  most 
twenty  users  in  the  friends’  graph.  We  derive  one  policy  (Second  Order)  that  optimizes 
over  the  first  and  second  order  paths  in  the  objective  but  does  not  consider  vertex  or 
edge  centralities.  We  derive  another  policy  (First  Order)  that  optimizes  over  the  first 
order  paths  exclusively.  We  then  experiment  with  different  centrality  scores  C  such 
as  in-degree  centrality,  out-degree  centrality,  eigenvector  centrality,  edge-betweenness 
centrality  (in  which  case  we  modify  our  objective,  replacing  the  term  c,xt  with  Cijijij, 
where  Qj  is  the  edge  betweenness  score  between  i  and  j  and  is  still  the  binary 
edge  variable  associated  with  ( i,j )).  We  normalize  each  centrality  score  so  that  they 
sum  to  one.  For  each  vertex  centrality  measure,  we  also  derive  a  policy  that  excludes 
the  second  order  term  from  the  objective  and  simulate  with  the  resulting  interaction 
sequences.  So,  for  example,  the  policy  “Eigenvector  First-Order”  selected  a  partial 
order  and  optimized  over  the  vertex  weights  of  the  targets,  the  eigenvector  centrality 
regularizer  and  the  first  order  paths  of  the  objective  function  alone. 

We  compare  the  optimization-derived  policies  with  two  random  policies:  1)  a  pol¬ 
icy  (Random  append)  that  selects  a  random  permutation  of  seventeen  members  of 
the  friends  graph  and  then  appends  a  random  permutation  of  the  three  target  users 
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and  2)  another  policy  (Random)  that  realizes  a  random  permutation  of  twenty  users 
in  the  graph  that  must  include  the  three  targets.  Note  that  the  different  between 
the  two  random  policies  is  that  the  first  one  constrain  the  three  targets  to  be  at  the 
end  of  the  sequence,  whereas  the  second  version  allows  the  three  targets  to  fall  any¬ 
where  in  the  interaction  sequence.  In  addition,  we  compare  the  integer  programming 
policies  with  policies  that  are  based  solely  on  network  centrality  measures.  For  ex¬ 
ample  the  policy“Eigenvector  Centrality”  takes  the  input  “friends  graph”,  removes  the 
three  targets,  orders  the  seventeen  vertices  with  the  greatest  eigenvector  centrality  in 
decreasing  order  of  their  scores.  Then,  calculates  the  eigenvector  centralities  on  the 
three-vertex  graph  with  just  the  targets,  orders  the  targets  by  decreasing  eigenvector 
centrality  score,  and  appends  this  sequnce  to  the  already  ordered  list  of  seventeen 
friends  to  make  a  sequence  of  twenty  vertices.  The  results  are  listed  in  5.2.  We  simu¬ 
late  each  policy  10,000  times  and  show  the  fraction  of  times  that  each  user  followed 
back  as  well  as  the  sum  of  the  three  fractions  precentages.  For  each  user,  the  max¬ 
imum  possible  value  is  1  and  the  maximum  possible  value  of  the  sum  is  3.  The  a 
priori  follow  probabilities  for  the  three  users  are  listed  in  the  first  row  of  5.2. 

As  we  would  expect,  the  random  policy  that  constrains  the  targets  to  fall  at  the 
end  of  the  sequence  outperforms  the  random  policy  that  allows  that  targets  to  appear 
anywhere.  Since  the  follow  probabilities  are  increasing  in  overlap,  we  maximize  the 
follow  probability  of  a  user  by  being  followed  by  his  friends  Erst.  Most  of  the  policies 
that  are  based  solely  on  centrality  measures  perform  similarly  to  the  random  policies. 
Of  note,  the  policies  based  on  in-degree  centrality  and  eigenvector  centrality  perform 
better  than  the  other  centrality-based  policies.  Eigenvector  centrality  performs  20% 
better  than  the  next  best  centrality-based  policy.  Since  eigenvector  centrality  weights 
the  importance  of  its  nodes  by  the  importance  of  its  neighbors  [7],  we  intuitively 
understand  that  ordering  interactions  based  upon  this  measure  might  help  to  capture 
valuable  paths  in  the  graph  that  helps  increase  the  overlap  phenomenon. 

For  each  centrality  measure,  both  first-order  and  second-order  integer  program¬ 
ming  policies  outperform  the  purely  centrality-based  policy.  This  is  not  surprising 
since  the  integer  programming  policies  (IP)  take  into  account  the  a  priori  vertex 
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Table  5.2:  Computer  Scientist  Friends  Graph  Results 


Policy 

@ylecun 

@SebastianThrun 

@  Andrew  YNg 

Sum 

Prior 

0.143 

0.083 

0.110 

0.336 

Random 

0.177 

0.090 

0.144 

0.410 

Random  Append 

0.260 

0.087 

0.156 

0.503 

Eigenvector  Centrality 

0.378 

0.084 

0.169 

0.630 

Eigenvector  First-Order 

0.443 

0.087 

0.151 

0.682 

Eigenvector  Second- Order 

0.407 

0.082 

0.155 

0.643 

In-Degree  Centrality 

0.263 

0.086 

0.186 

0.535 

In-Degree  First-Order 

0.421 

0.084 

0.165 

0.670 

In-Degree  Second-Order 

0.422 

0.088 

0.138 

0.647 

Closeness  Centrality 

0.164 

0.086 

0.139 

0.389 

Closeness  First-Order 

0.460 

0.082 

0.156 

0.692 

Closeness  Second-Order 

0.452 

0.081 

0.147 

0.680 

Out-Degree  Centrality 

0.177 

0.090 

0.144 

0.410 

Out-Degree  First  Order 

0.470 

0.084 

0.132 

0.687 

Out-Degree  Second  Order 

0.425 

0.083 

0.148 

0.647 

Load  Centrality 

0.205 

0.078 

0.156 

0.439 

Load  First-Order 

0.370 

0.087 

0.156 

0.614 

Load  Second  Order 

0.373 

0.079 

0.155 

0.607 

Edge  Betweenness 

0.4417 

0.085 

0.149 

0.675 

Second  Order 

0.442 

0.086 

0.143 

0.670 

First  Order 

0.447 

0.083 

0.140 

0.670 
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susceptibilities  whereas  the  centrality  policies  do  not.  The  IP  policies  bias  towards 
selecting  vertices  that  have  high  prior  follow  probabilities.  Some  of  the  IP  poliies 
that  optimize  over  the  centrality  regularizer  outperform  the  IP  that  optimizes  over 
first  and  second  order  paths  exclusively  while  others  do  not.  Specifically  the  eigenvec¬ 
tor,  out-degree,  in-degree,  and  edge  betweenness  regularizes  perform  better  than  the 
path-only  policy.  Centrality  regularizes  can  only  add  value  to  the  integer  program  if 
the  correct  centrality  measure  is  selected. 

There  is  little  difference  between  the  performance  of  the  first  order  IP  policies  and 
the  second  order  IP  policies.  The  first  order  policy  with  no  centrality  regularization 
performed  virtually  the  same  as  its  second  order  counterpart.  In  fact,  for  all  the 
centrality  regularizes,  the  first  order  policies  outperform  the  integer  programs  that 
include  second  order  terms.  We  believe  this  might  have  taken  place  because  our 
approximation  for  calculating  expected  follows  assumes  that  the  follow  probability  is 
linear  in  overlap,  which  may  not  be  the  case.  Thus,  the  expression  might  be  over¬ 
counting  the  impact  that  second  order  terms  have  on  the  objective  value.  When 
added  to  the  first  order  objective,  vertex  centrality  regularizers  improved  the  policy’s 
performance  by  as  much  as  2%. 

These  findings  imply  that  including  second  order  terms  in  the  optimization  may 
not  be  essential  and  using  an  appropriately  selected  network  centrality  regularizer 
could  serve  as  a  valuable  surrogate  for  the  higher  order  terms  of  the  problem.  This 
result  is  particularly  valuable  when  using  integer  programming  to  solve  for  follow-back 
policies  on  large  scale  graphs.  By  removing  the  second-order  term  from  the  objective, 
we  can  reduce  the  size  of  our  problem  from  N 3  binary  variables  to  N2,  significantly 
increasing  tractability. 
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Chapter  6 


Future  Work  and  Conclusions 

6.1  Future  Work 

After  determining  an  interaction  sequence,  an  agent  would  want  to  know  how  to  time 
his  interactions.  For  instance,  how  long  after  interacting  with  the  first  user  in  the 
sequence  should  the  agent  wait  until  he  interacts  with  the  second  user,  and  so  on? 
Should  the  agent  wait  until  the  first  user  decides  to  follow  him?  What  if  this  never 
occurs?  The  agent  should  not  be  expected  to  wait  indefinitely.  What  if  there  is  a 
cost  per  unit  time  that  is  incurred  while  the  targets  are  not  following  the  agent?  If 
the  agent  does  not  gain  the  targets  as  followers  after  some  period  of  time,  perhaps 
the  targets  decide  to  follow  a  competitor.  The  longer  the  agent  takes  to  acquire  the 
targets  as  followers,  the  greater  the  chance  that  they  are  acquired  by  a  competitor. 
If  the  agent  is  aware  that  a  user  will  take  too  long  to  follow-back,  he  might  decide 
not  to  waste  time  interacting  with  that  user. 

6.1.1  Timing  Experiment 

To  incorporate  timing  into  our  optimization,  we  must  first  understand  if  and  what 
features  of  a  Twitter  user’s  behavior  is  indicative  of  how  quickly  he  will  follow-back. 
We  conducted  one  final  experiment  with  the  same  agents  we  used  in  2. 

We  had  each  agent  interact  with  users  that  had  tweeted  about  “Morocco”  or  “arts 
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Table  6.1:  Follow-back  Delay 


Feature 

P 

p- value 

Intercept 

2.38 

0.07 

Number  of  tweets 

-4.97 

0.67 

Percentage  retweets 

8.26 

0.62 

Percentage  hashtag 

6.01 

0.73 

Percentage  media 

6.39 

0.70 

The  significance  codes  are  ***  :  0.001,  **  :  0.01,  and 


0.05. 


and  crafts”  within  the  past  twenty-four  hours.  Then,  we  had  one  of  our  six  agents 
interact  with  these  users,  where  an  interaction  was  a  retweet  and  a  follow.  After 
interacting  with  each  user,  we  had  the  agent  check  its  followers  list  each  minute  to 
see  if  and  when  the  user  it  interacted  with  followed  the  agent. 

Over  a  1.5  week  period,  the  agents  interacted  with  500  Twitter  users.  Of  the  500 
interactions,  64  users  followed  the  agents  back.  The  13%  follow-back  rate  is  consistent 
with  what  we  observed  in  2.  For  each  user  that  followed-back,  we  gathered  some  of 
the  features  of  that  user’s  Twitter  behavior  such  a  as  the  number  of  tweets  they 
have  posted  in  total,  the  percentage  of  these  tweets  that  are  retweets,  the  percentage 
that  contain  pieces  of  media  such  as  linked  photos  are  videos,  and  the  percentage 
that  contain  hashtags.  We  then  used  linear  regression  to  predict  that  amount  of 
time  it  took  the  users  to  follow-back,  regressing  on  these  features.  The  results  of  the 
regression  are  shown  in  6.1. 

None  of  the  features  we  checked  were  significant  in  predicting  the  delay  in  how 
long  it  takes  a  user  to  follow-back.  However,  we  believe  that  given  enough  data  and 
the  proper  experimental  set  up,  some  user  features  may  be  significant  in  predicting 
the  follow-back  delay.  We  conjecture  that  people  who  use  Twitter  more  frequently 
are  more  inclined  to  follow-back  faster,  as  they  will  likely  see  the  agent’s  interaction 
sooner.  A  future  line  of  research  could  involve  finding  viable  features. 

Modeling  time  delays  between  interactions  would  also  increase  the  complexity 
of  our  general  follow-back  dynamic  programming  problem  presented  in  1.5.  Future 
research  could  use  the  framework  of  continuous-time  dynamic  programming  to  analyze 
the  problem  and  gain  insight.  Incorporating  timing  decisions  into  the  integer  program 
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outlined  in  5  would  greatly  increase  its  complexity  and  could  potentially  become  an 
issue.  This  is  another  area  for  future  research. 


6.1.2  Maintaining  Connections 

The  follow-back  problem  focused  on  establishing  a  connection  with  a  primary  target. 
However,  once  a  connection  is  established,  it  can  be  strengthened  through  other 
types  of  interactions  such  as  replying  or  retweeting.  A  natural  next  step  in  this 
work  is  to  develop  optimal  policies  for  maintaining  or  strengthening  connections  in  a 
social  network.  Further  empirical  analysis  will  be  needed  to  understand  the  dynamics 
of  repeated  interactions.  However,  once  a  model  for  the  effect  of  such  interactions 
on  a  social  connection  is  developed,  the  follow-back  problem  can  be  extended  to 
find  optimal  policies  for  maintaining  or  strengthening  connections  through  repeated 
interactions  in  a  social  network. 


6.2  Conclusion 

We  have  proposed  the  follow-back  problem  for  targeting  users  in  a  social  network. 
We  conducted  empirical  studies  in  Twitter  to  find  what  user  features  and  interaction 
types  cause  users  to  follow.  Specifically,  we  identified  the  importance  of  a  agent’s 
local  “overlap”  with  a  target  user  in  gaining  the  target’s  follow.  Using  this  analysis 
we  construct  a  simple  model  for  the  follow  probability.  We  then  use  this  model  to 
develop  optimal  policies  for  the  follow-back  problem.  Through  this  we  develop  the 
notion  of  a  follow-back  score  on  a  directed  acyclic  graph. 

We  come  up  with  a  graph  theoretic  algorithm  to  derive  a  follow-back  policy  on  an 
arbitrary  graph  which  we  see  perform  well  compared  with  other  policies  in  simulation. 
We  also  model  our  problem  as  a  constrained  integer  program  to  select  an  optimal 
directed  acyclic  subgraph  of  an  arbitrary  directed  graph. 

We  truncated  some  of  the  higher  order  terms  to  make  the  problem  tractable  on 
large  scale  Twitter  networks.  We  used  network  centrality  regularizer  functions  to 
replace  some  granularity  lost  in  the  objective  by  truncating  those  higher  order  terms. 
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We  show  in  simulation  that  this  approach  performs  well.  Our  work  supports  the 
power  of  network  centrality  measures.  When  combinatorial  optimization  in  graphs 
becomes  intractable,  network  centrality  measures  serve  as  useful  functions  that  are 
cheap  to  compute.  Researchers  may,  at  times,  find  it  viable  to  discard  intractable 
portions  of  their  math  programs  and  use  network  centrality  regularizers  as  helpful 
surrogates. 

6.3  Application 

Learning  how  to  network  with  others  and  spread  information  has  uses  from  sec¬ 
tors  of  society  ranging  from  defense  to  business.  People  have  long  understood  the 
skills  needed  to  network  and  communicate  effectively  in  the  offline  setting  and  have 
recognized  the  importance  of  these  skills.  As  the  adoption  of  social  media  has  made 
networking  and  spreading  information  easier,  organizations  that  want  to  remain  com¬ 
petitive  must  begin  using  more  strategic  ways  of  connecting  with  others  online.  If 
they  do  not,  they  risk  being  drowned  out  in  the  ever  increasing  volume  of  content  that 
travels  around  cyberspace.  Though  the  follow-back  problem  was  framed  over  Twitter, 
the  intuition  and  modeling  can  be  used  to  more  effectively  connect  with  users  in  other 
types  of  social  media  networks.  Our  model  is  merely  preliminary  work  in  a  rapidly 
growing  held. 
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